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Abstract. We show that a perturbation of any fixed square matrix D by a 
random unitary matrix is well invertible with high probability. A similar result 
holds for perturbations by random orthogonal matrices; the only notable excep- 
tion is when D is close to orthogonal. As an application, these results completely 
eliminate a hard-to-check condition from the Single Ring Theorem by Guionnet, 
Krishnapur and Zeitouni. 
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1. Introduction 

1.1. The smallest singular values of random matrices. Singular values cap- 
ture important metric properties of matrices. For an x n matrix A with real 
or complex entries, n < N, the singular values Sj{A) are the eigenvalues of |A| = 
{A*A)^/'^ arranged in a non-decreasing order, thus si{A) > ...Sn{A) > 0. The 
smallest and the largest singular values play a special role. si{A) is the operator 
norm of A, while Sinin(^) := Sn{A) is the distance in the operator norm from A to 
the set of singular matrices (those with rank smaller than n). For square matrices, 
where N = n, the smallest singular value SniA) provides a quantitative measure of 
invertibility of A. It is natural to ask whether typical matrices are well invertible; 
one often models "typical" matrices as random matrices. This is one of the reasons 
why the smallest singular values of different classes of random matrices have been 
extensively studied (see [17] and the references therein). 

On a deeper level, questions about the behavior of Smin(^) for random A arise 
in several intrinsic problems of random matrix theory. Quantitative estimates of 
•Smin(^) for Square random matrices A with independent entries [151 [181 [161 [19] were 
instrumental in proving the Circular Law, which states that the distribution of the 
eigenvalues of such matrices converges as n — )■ oo to the uniform probability measure 
on the disc [9l[20]. Quantitative estimates on Smin(^) of random Hermitian matrices 
A with independent entries above the diagonal were necessary in the proof of the 
local semicircle law for the limit spectrum of such matrices [1112I]- Stronger bounds 
for the tail distribution of the smallest singular value of a Hermitian random matrix 
were established in [231 IS] , see also [H] . 

1.2. The main results. In the present paper we study the smallest singular value 
for a natural class of random matrices, namely for random unitary and orthogonal 
perturbations of a fixed matrix. Let us consider the complex case first. Let D be 
any fixed nxn complex matrix, and let U he a random matrix uniformly distributed 
over the unitary group U{n) with respect to the Haar measure. Then the matrix 
D + U is non-singular with probability 1, which can be easily observed considering 
its determinant. However, this observation does not give any useful quantitative 
information on the degree of non-singularity. A quantitative estimate of the smallest 
singular value D + U is one of the two main results of this paper. 

Theorem 1.1 (Unitary perturbations). Let D he an arbitrary fixed nxn matrix, 
n>2. Let U be a random matrix uniformly distributed in the unitary group U{n). 
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Then 

IP {Smm(^ + U)<t} < frf, t > 0. 

In the statement above and thereafter C, c denote positive absolute constants. 
As a consequence of Theorem ll.H the random matrix D + U is well invertible, 
\\{D + = n'^(^) with high probability. 

An important point in Theorem 11.21 is that the bound is independent of the 
deterministic matrix D. This feature is essential in the application to the Single 
Ring Theorem, which we shall discuss in Section [1.41 below. 

To see that Theorem 11.21 is a subtle result, note that in general it fails over 
the reals. Indeed, suppose n is odd. If —D,U G SO{n), then —D~^U G SO{n) 
has eigenvalue 1 and as a result D + U = D{In + D^^U) is singular. Therefore, 
if D G 0(n) is any fixed matrix and U G 0{n) is random uniformly distributed, 
Srmn{D + ?7) = with probability at least 1/2. However, it turns out that this 
example is essentially the only obstacle for Theorem 11.11 in the real case. Indeed, 
our second main result states that if D is not close to 0{n), then D + U is well 
invertible with high probability. 

Theorem 1.2 (Orthogonal perturbations). Let D be a fixed n x n real matrix, 
n>2. Assume that 

(1.1) \\D\\<K, inf \\D-V\\>5 

yeO{n) 

for some ii' > 1, 5 G (0, 1). Let U he a random matrix uniformly distributed in the 
orthogonal group 0{n). Then 

^{smin{D + U)<t}<t%Kn/6f, t>0. 

Similarly to the complex case, this bound is uniform over all matrices D satisfying 
(jl.ip . This condition is relatively mild: in the case when K = n^^ and 6 = n"'^^ 
for some constants Ci, C2 > 0, we have 

F {Smin(^ + U)<t] < f'n^, t > 0, 

as in the complex case. It is possible that the condition \\D\\ < K can be eliminated 
from the Theorem ll.2( we have not tried this in order to keep the argument more 
readable, and because such condition already appears in the Single Ring Theorem. 

Motivated by an application to the Single Ring Theorem, we shall prove the 
following more general version of Theorem 11.21 which is valid for complex diagonal 
matrices D. 

Theorem 1.3 (Orthogonal perturbations, full version). Consider a fixed matrix 
D = diag((ii, . . . , dn), n >2, where di G C. Assume that 

(1.2) max|(ij| < J^, max|(i? — d?| > 5 

i i,j 

for some K > 1, 5 & (0, 1). Let U be a random matrix uniformly distributed in the 
orthogonal group 0{n). Then 

^{sminiD + U) <t} <t^{Kn/6f, t>0. 

Let us show how this result implies Theorem 11.21 
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Proof of Theorem from Theorem \1.3[ Without loss of generality, we can assume 
that t < 5/2. Further, using rotation invariance of U we can assume that D = 
diag((ii, . . . , dn) where all di > 0. The assumptions in (jl.ip then imply that 

(1.3) max|(ij| < K, max|dj — 1\ > S. 

i i 

If maxjj \df — d^l > (5^/4 then we can finish the proof by applying Theorem 11.31 
with 5^/4 instead of 6. In the remaining case we have 

max \di — dj\^ < max \d^ — d^A < (5^/4, 

which implies that maxjj \di — dj\ < 5/2. Using (jl.3p . we can choose iq so that 
{dig — 1\ > S. Thus either di^ > 1 + 6 or di^ < 1 — 5 holds. 

If di^ > 1 + 6 then di > di^ — 6/2 > 1 + 6/2 for all i. In this case 

SrniniD + U) > S^UD) - || [/|| > 1 + 5/2 " 1 > t, 

and the conclusion holds trivially with probability 1. 

dif^ < 1 — 6 then similarly di < di^ + 6/2 < 1 — 6/2 for all i. In this case 

SminiD + U)> S,nm(f/) " \\D\\ > 1 - (1 - 6/2) = 6/2 > t, 

and the conclusion follows trivially again. □ 

1.3. A word about the proofs. The proofs of Theorems 11.11 and 11.31 are signifi- 
cantly different from those of corresponding results for random matrices with i.i.d. 
entries \15\ [T6] and for symmetric random matrices [23j . The common starting point 
is the identity Smm(^) = ^^^x&S"~^ ||^2;||2. The critical step of the previous argu- 
ments \15\ [161 123] was the analysis of the small ball probability ¥ {\\Ax\\2 < t} for 
a fixed vector x € 5"""^. The decay of this probability as t — t- is determined by 
the arithmetic structure of the coordinates of the vector x. An elaborate covering 
argument was used to treat the set of the vectors with a "bad" arithmetic struc- 
ture. In contrast to this, arithmetic structure plays no role in Theorems 1 1.11 and II. 31 
The difficulty lies elsewhere - the entries of the matrix D + U are not independent. 
This motivates one to seek a way to introduce some independence into the model. 
The independent variables have to be chosen in such a way that one can tractably 
express the smallest singular value in terms of them. We give an overview of this 
procedure in Section [2] below. 

The proof of Theorem 11.31 is harder than that of Theorem 11.11 To make the 
arguments more transparent, the proofs of the two theorems are organized in such 
a way that they are essentially self-contained and independent of each other. The 
reader is encouraged to start from the proof of Theorem 11.11 

1.4. An application to the Single Ring Theorem. The invertibility problem 
studied in this paper was motivated by a limit law of the random matrix theory, 
namely the Single Ring Theorem. This is a result about the eigenvalues of random 
matrices with prescribed singular values. The problem was studied by Feinberg and 
Zee [6] on the physical level of rigor, and mathematically by Guionnet, Krishna- 
pur, and Zeitouni [10]. Let Dn = diag{d^^\ . . . , d^^) be an n x n diagonal matrix 
with non-negative diagonal. If we choose C/„, Vn to be independent, random and 
uniformly distributed in U{n) or 0{n), then An = UnDnVn constitutes the most 
natural model of a random matrix with prescribed singular values. The matrices Dn 
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can be deterministic or random; in the latter case we assume that Un and Vn are 
independent of Dn- 

The Single Ring Theorem [10] describes the typical behavior of the eigenvalues 
of A ^ as fi — >■ oo. To state this result, we consider the empirical measures of the 
singular values and the eigenvalues of A^. 



where 6x stands for the 5-measure at x, and , . . . , A„ denote the eigenvalues of 

An. Assume that the measures /ii"^ converge weakly in probability to a measure fig 
compactly supported in [0, oo). The Single Ring Theorem [10] states that, under 
certain conditions, the empirical measures of the eigenvalues //i"^ converge in prob- 
ability to an absolutely continuous rotationally symmetric probability measure //e 
on C. Haagerup and Larsen |12j previously computed the density of /ig in terms of 
fig in the context of operator algebras. 

In the formulation of this result, (T„(z) := — denotes the smallest 

singular value of the shifted matrix, and S*^ denotes the Stieltjes transform of a 
Borel measure fi on R: 



Single Ring Theorem. [TH] Assume that the sequence {/is converges weakly 

to a probability measure fis compactly supported on M+. Assume further: 

(SRI) There exists M > such that P {||-Dn|| > M} ^ as n ^ oo; 

(SR2) There exist constants k,ki > such that for any z G C, Im(2;) > n~'^, 



(SR3) There exists a sequence of events Vtn with — )• 1 and constants 6,5' > 
such that for Lebesgue almost any z G C, 



Then the sequence {^e s'^=i converges in probability to a probability measure /ie- 
The measure fie ho,s density, which can be explicitly calculated in terms of the mea- 
sure /ijHI, and whose support coincides with a single ring {z £ C : a < \z\ < b} for 
some < a < b < oo. 

The explicit formula for the density of the measure fie shows that is strictly 
positive in the interior of the ring. This is surprising since the support of the 
measure fig can have gaps. Informally, this means that there are no forbidden zones 
for the eigenvalues, even in the case when there are such zones for singular values. 

The inner and outer radii of the ring can be easily calculated |11] : 



The first two conditions of the Single Ring Theorem are effectively checkable for 
a given sequence d^^\ . . . ,d^n^. Indeed, condition (SRI) is readily reformulated in 

^See [121 Theorem 4.4] and [101 Theorem 1] for a precise description of jie. 





lm{S (n){z))\ < Ki. 



IE [ln„l<x„(2)<n-^ log^CT„(z)] < 6'. 



(1.4) 
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terms of this sequence, since ||-D„|| = si{Dn) = max((i^"\ . . . , dl^^). Condition (SR2) 
is already formulated in terms of this sequence; it means that the singular values of 
the matrices -D„ cannot concentrate on short intervals. 

Since the relation between the singular values of the original and shifted matrices 
is not clear, condition (SR3) is much harder to check. It has only been verified in 
|10j for the original setup of Feinberg and Zee [6], namely for the case when the 
singular values of An are random variables with density 

n / " 

f{dt\ . . . , ~ n 1(4^^^ - idt'^fi' • exp ( - piid'f^f)) ■ n 4 
j<k j=i \j=i 

where P is a polynomial with positive leading coefficient, and where /5 = 1 in the 
real case and /3 = 2 in the complex case. The proof of condition (SR3) for this 
model is based on adding small Gaussian noise to the matrix An and using coupling 
to compare the eigenvalue distributions of random matrices with and without noise. 
Such approach does not seem to be extendable to more general distributions of 
singular values. 

As an application of Theorems 11.11 and 11.31 one can show the following: 

Corollary 1.4. Condition (SR3) can always he eliminated from the Single Ring 
Theorem. 

The remaining conditions (SRI) and (SR2) are formulated in terms of the singular 
values of the original matrix Dn- This means that the validity of the Single Ring 
Theorem for a concrete sequence of matrices Dn can now be effectively checked. 

1.5. Organization of the paper. The rest of this paper is organized as follows. 
In Section [2] we give an overview and heuristics of the proofs of both main results. 
Theorem 11.11 is proved in Section [3l Theorem 11.31 is proved in Section |4] with the 
exception of the low dimensions n = 2, 3 that are treated separately in Appendix [Aj 
some standard tools in the proof of Theorem 11.31 are isolated in Appendix |Bl In 
Section [5] we prove Corollary 11.41 concerning the Single Ring Theorem. 

1.6. Notation. We will use the following notation. 

Positive absolute constant are denoted C, Ci, c, ci, . . .; their values may be differ- 
ent in different instances. The notation a < 6 means a < Cb where C is an absolute 
constant; similarly for a>b. 

The intervals of integers are denoted by [n] := {1, 2, . . . , n} and [k : n] := {k,k + 
1, . . . , n} for k < n. 

Given a matrix A, the minor obtained by removing the first, second and fifth 
rows and columns of A is denoted ^(1,2.5) j similarly for other subsets of rows and 
columns. 

The identity matrix on M" and C" is denoted /„; we often simply write / if the 
ambient dimension is clear. 

Since we will be working with several sources of randomness at the same time, 
we denote by Px,y('^^) the conditional probability of the event £ given all random 
variables except X, Y. 

The operator norm of a matrix A is denoted || and the Hilbert-Schmidt (Frobe- 
nius) norm is denoted ||^||hs- 
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The diagonal matrix with diagonal entries di, . . . ,dn is denoted diag((ii, . . . , 
Finally, without loss of generality we may assume in Theorems 11.11 and 11.31 that 
t < c5 for an arbitrarily small absolute const c > 0. 

Acknowledgement. The authors are grateful for Ofer Zeitouni for drawing their 
attention to this problem, and for many useful discussions and comments. The 
second author learned about the problem at the IMA Workshop oh High Dimensional 
Phenomena in September 2011; he is grateful to IMA for the hospitality. The authors 
are grateful to Anirban Basak who found an inaccuracy in the earlier version of this 
paper, specifically in the application of Theorem ll.2l to the Single Ring Theorem over 
reals. Amir Dembo communicated this to the authors, for which they are thankful. 

2. Strategy of the proofs 

Let us present the heuristics of the proofs of Theorems 11.11 and 11.31 Both proofs 
are based on the idea to use local and global structures of the Lie groups U{n) and 
0{n), but the argument for 0{n) is more difficult. 

2.1. Unitary perturbations. Our proof of Theorems 11.11 uses both global and 
local structures of the Lie group U{n). The local structure is determined by the 
infinitesimally small perturbations of the identity in U{n), which are given by skew- 
Hermitian matrices. This allows us to essentially replace D + U (up to O(e^) error) 
by 

VD + I + eS 

where V is random matrix uniformly distributed in U{n), S is an independent 
skew-Hermitian matrix, and e > is a small number. The distribution of S can 
be arbitrary. For example, one may choose S to have independent normal above- 
diagonal entries. (In the actual proof, we populate just one row and column of S 
by random variables leaving the other entries zero, see ()3.4p .) After conditioning on 
V, we are left with a random matrix with a lot of independent entries - the quality 
that was missing from the original problem. 

However, this local argument is not powerful enough, in particular because real 
skew-Hermitian (i.e. skew-symmetric) matrices themselves are singular in odd di- 
mensions n. This forces us to use some global structure of U{n) as well. A simplest 
random global rotation is a random complex rotation R in one coordinate in C" 
(given by multiplication of that coordinate by a random unit complex number). 
Thus we can essentially replace D + U hy 

A = RVD + I + eS, 

and we again condition on V . A combination of the two sources of randomness, 
a local perturbation S and a global perturbation R, produces enough power to 
conclude that A is typically well invertible, which leads to Theorem ll.il 
The formal proof of Theorem 11.11 is presented in Section [3l 

2.2. Orthogonal perturbations. Our proof of Theorem 11.31 will also make use 
of both global and local structures of the Lie group 0{n). The local structure is 
determined by the skew-symmetric matrices. As before, we can use it to replace 
D + U hy VD + I + eS where V is random matrix uniformly distributed in 0{n) and 
S is a random independent Gaussian skew-symmetric matrix (with i.i.d. iViR(0, 1) 
above-diagonal entries). 
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Regarding the global structure, the simplest random global rotation in 0{n) is a 
random rotation R of some two coordinates in M", say the first two. Still, R alone 
does not appear to be powerful enough, so we supplement it with a further random 
change of basis. Specifically, we replace D with D = QDQ^ where Q is a random 
independent rotation of the first two coordinates. Overall, we have changed D + U 
to 

A = RVD + I + eS, where D = QDQ^. 

Only now do we condition on V, and we will work with three sources of randomness 
- a local perturbation given by S and two global perturbations given by R and Q. 

2.2.1. Decomposition of the problem. By rotation invariance, we can assume that 
D is diagonal, thus D = diag((ii, . . . ,dn). By assumption, df and dj are not close 
to each other for some pair of indices i,j; without loss of generality we can assume 
that di and are not close to each other. Recall that our task is to show that 



(2.1) s^in(^)= inf \\Ax\\2>e 

with high probability. (In this informal presentation, we suppress the dependence 
on n; it should always be polynomial). Each x G S"'~^ has a coordinate whose mag- 
nitude is at least n^^/^. By decomposing the sphere according to which coordinate 
is large, without loss of generality we can replace our task ()2.ip by showing that 

inf \\Ax\\2 > e 

where Si^2 consists of the vectors x G S*""^ with + |x2p > 1/n. 

In order to use the rotations R, Q which act on the first two coordinates, we 
decompose A as follows: 



(2.2) A. 



X ^(1,2) 



where e C^^^ A(i,2) G C^"-^)^^"-^) 



We condition on everything except Q, R and the first two rows and columns of S. 
This fixes the minor ^(1,2) • We will proceed differently depending on whether ^(1,2) 
is well invertible or not. 

2.2.2. When the minor is well invertible. Let us assume that 

||M|| < i where M := (^2))"^- 
It is not difficult to show (see Lemma l4.4p that in this case 

inf \\Ax\\2>e-Smin{Ao-YMX). 

So our task becomes to prove that 

(Ao - YMX) > 1. 

We have reduced our problem to invertibility of a 2 x 2 random matrix. 

The argument in this case will only rely on the global perturbations Q and R and 
will not use the local perturbation S. So let us assume for simplicity that S = 0, 
although removing S will take some effort in the formal argument. Expressing the 
matrix ^0 ~ YMX as a function of R, we see that 

^0 - YMX = I + RqB 
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where B G C^^^ and Rq G 0(2) is the part of R restricted to the first two coordinates 
(recall that R is identity on the other coordinates). 

Note that / + RqB has the same distribution as Rq^ + B and Rq is uniformly 
distributed in 0(2). But invertibility of the latter matrix is the same problem as we 
are studying in this paper, only in dimension two. One can prove Theorem 11.31 in 
dimension two (and even for non-diagonal matrices) by a separate argument based 
on Remez-type inequalities; see Appendix |3 It yields that unless B is approxi- 
mately complex orthogonal, i.e. II-BS"*" — /|| ^ II^IP) the random matrix / + RqB 
is well invertible with high probability in Rq, leading to the desired conclusion. We 
have thus reduced the problem to showing that B is not approximately complex 
orthogonal. 

To this end we use the remaining source of randomness, the random rotation Q. 
Expressing 6 as a function of Q, we see that 

B = TDo 

where T € C^^^ is a fixed matrix, Dq = QoDqQq, and Qo, Dq are the 2x2 minors of 
Q and D respectively. Thus Qo is a random rotation in SO{2) and Dq = diag(di, ^2). 

Now we recall our assumption that df and d!^ are not close to each other. It 
is fairly easy to show for such Dq that, whatever the matrix T is, the random 
matrix B = TDq = TQqDqQq is not approximately complex orthogonal with high 
probability in Qq (see Lemma 14.61) . This concludes the argument in this case. The 
formal analysis is presented in Section [4.31 

2.2.3. When the minor is poorly invertible. The remaining case is when 

||M|| > ^, where M := (^(^2))""^- 

We will only use the local perturbation S in this case. 

Here we encounter a new problem. Imagine for a moment that the were working 
with decompositions into dimensions 1 + (n — 1) rather than 2 + (n — 2), thus in 
([Z2]) we had Aq £ C^^\ ^{1,2) ^ C^""^)^^'""^). Using the Gaussian random vector 
X, one could quickly show (see Lemma l4.8p that in this case 

(2.3) inf ||Ix||2 > e 

xeSi 

with high probability, where 5*1 consists of the vectors x G S*""^ with |xi| > 

Unfortunately, this kind of argument fails for decompositions into dimensions 
2 + (n — 2) which we are working with. In other words, we can step one rather than 
two dimensions up - from a poor invertibility of an (n — 1) x (n — l) minor to the good 
invertibility of the nxn matrix (on vectors with the large corresponding coordinate) . 
The failure of stepping two dimensions up has a good reason. Indeed, one can show 
that Gaussian skew symmetric matrices are well invertible in even dimensions n and 
singular in odd dimensions n. Since our argument in the current case only relies on 
the local perturbation given by a Gaussian skew symmetric matrix, nothing seems 
to prevent both the (n — 2) x (n — 2) minor and the full nxn matrix to be poorly 
invertible if n is odd. 

To circumvent this difficulty, we shall redefine the two cases that we have worked 
with, as follows. 
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Case 1: There exists an (n — 3) x (n — 3) minor ^(1^2, «) of ^(1,2) which is weh 
invertible. In this case one proceeds by the same argument as in Section 12.2.21 but 
for the decomposition into dimensions 3 + (n — 3) rather than 2 + (n — 2) . The formal 
argument is presented in Section 14.31 

Case 2: All (n — 3) x (n — 3) minors ^(i,2,i) of ^(1,2) ^^e poorly invertible. Let 
us fix i and apply the reasoning described above, which allows us one to move one 
dimension up, this time from n — 3 to n — 2. We conclude that ^(1,2) is well invertible 
on the vectors whose i-th coordinate is large. Doing this for each i and recalling 
that each vector has at least one large coordinate, we conclude that ^(1,2) is well 
invertible on all vectors. Now we are in the same situation that we have already 
analyzed in Section r2.2.2l as the minor ^(1,2) is well invertible. So we proceed by the 
same argument as there. The formal analysis of this case is presented in Section [4.41 

Summarizing, in Case 1 we move three dimensions up, from n — 3 to n, in one step. 
In Case 2 we make two steps, first moving one dimension up (from poor invertibility 
in dimension n — 3 to good invertibility in dimension n — 2), then two dimensions 
up (from good invertibility in dimension n — 2 to good invertibility in dimension n). 

This concludes the informal presentation of the proof of Theorem 11.31 

3. Unitary perturbations: proof of Theorem 11.11 
In this section we give a formal proof of Theorem 11.11 

3.1. Decomposition of the problem; local and global perturbations. 

3.1.1. Decomposition of the sphere. By definition, we have 

■Smin{D + U)= inf \\{D + U)x\\2. 

Since for every x G 5"""^ there exists a coordinate i E [n] such that \xi\ > l/-v/n, a 
union bound yields 

(3.1) ¥{smin{D + U) <t}< j inf \\{D + U)x\\2 < t] 

1=1 

where 

Si = {xe S""-^ : \xi\ > 1/^} . 
So, without loss of generality, our goal is to bound 

(3.2) p| inf \\{D + U)x\\2 <t\ . 

{x&Si J 

3.1.2. Introducing local and global perturbations. We can express U in distribution 
as 

(3.3) U = V-'^R-^W 

where V,R,W£ U (n) are random independent matrices, such that V is uniformly 
distributed in U{n) while R and W may have arbitrary distributions. In a moment, 
we shall choose i? as a random diagonal matrix (a "global perturbation"), W as 
a small perturbation of identity with a random skew-Hermitian matrix (a "local 
perturbation"), and we shall then condition on V. 
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So we let V be uniform in U (n) and let 

i? = diag(r,l,...,l), 

where r is a random variable uniformly distributed on the unit torus T C C. Finally, 
W will be defined with the help of the following standard lemma. It expresses 
quantitatively the local structure of the unitary group U{n), namely that the tangent 
space to U (n) at the identity matrix is given by the skew-Hermitian matrices. 

Lemma 3.1 (Perturbations of identity in U (n)). Let S be an nx n skew-Hermitian 
matrix (i.e. S* = —S), let e > and define 

Wo = I + eS. 

Then there exists W & U (n) which depends only on Wq and such that 

\\W - WoW < 2e^\\S^\\ whenever e^\\S^\\ < 1/4. 

Proof. We write the singular value decomposition Wq = UqTiVo where Uq, Vq £ U (n) 
and S is diagonal with non-negative entries, and we define W := UoVq. Since S is 
skew-Hermitian, we see that 

W^Wo = {! + eS)*{I + eS) = I- e^S^, 

so TVg Wo — I = £^5'^. On the other hand, the singular value decomposition of Wq 
yields VFqT^o - I = ^o*(^^ - IWo- Combining these we obtain 

<e^\\S^\\. 

Assuming e^HS^H < 1/4 and recalling that S is a diagonal matrix with non- negative 
entries we conclude that 

||S - /|| < 2e^\\S^\\. 

It follows that 

11^ - WoW = \\Uo{I - ^)Vo\\ = \\I - S|| < 2e^\\S^\\, 
as claimed. □ 

Now we define the random skew-Hermitian matrix as 

^s -Z^^ 



(3.4) S 







where s ~ iV]R(0, 1) and Z ^ A'ir(0, are independent standard normal random 
variable and vector respectively. Clearly, S is skew-Hermitian. 

Let e G (0, 1) be an arbitrary small number. We define Wq and W as in 
Lemma 13.11 and finally we recall that a random uniform U is represented as in 



3.1.3. Replacing D+U by RVD+I+eS. Let us rewrite the quantity to be estimated 
()3.2p in terms of the global and local perturbations. Applying Lemma 13.11 for the 
random matrix Wq = I + £S, we obtain a random matrix W € U{n), which satisfies 
the following for every x £ Si: 

\\{D + U)x\\2 = \\iD + V~^R~^W)x\\2 = \\{RVD + W)x\\2 

> WiRVD + Wo)x\\2 - \\W - WoW 

> WiRVD + I + eS)xW2 - 2e^WS'^\\ whenever e^y 52 y < ;l/4_ 
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Further, E\\Sf < + 2||Z||| = 2n - 1, so ||S|| = 0{y/n) with high probabihty. 
More precisely, let Kq > 1 be a parameter to be chosen later, and which satisfies 

(3.5) e^K^n < 1/4. 
Consider the event 

(3.6) £s = {\\S\\ < Ko^}; then P(£:^) < 2exp{-cK^n) 

by a standard large deviation inequality (see e.g. [22l Corollary 5.17]). On £s, one 
has e^ll^^ll < 1/4 due to ([33]) . and thus 

\\{D + U)x\\2 > WiRVD + 1 + eS)x\\2 - 2e^Kln. 

Denote 

A := RVD + I + eS. 
let n G (0, 1) be a parameter to be chosen later, and which satisfies 

(3.7) > 2eK^n. 
By the above, our goal is to estimate 

inf \\{D + U)x\\2 <fie\<F\mi \\Ax\\2 < fie + 2eK^n A £s \ +^{£3) 



(3.8) < 



<^ inf \\Ax\\2 < 2fj,e A£s } +2exp{-cK^n) 



VD 



A = RVD + I + eS 



where u,v £ 



Summarizing, we now have a control of the first coordinate of x, we have introduced 
the global perturbation R and the local perturbation 5, and we replaced the random 
matrix D + U hy A = RVD + 1 + eS. 

3.1.4. Decomposition into 1 + (n — 1) dimensions. Next, we would like to expose 
the first row and first column of the matrix A = RVD + I + eS. We do so first for 
the matrix 

'{VD)n v'^ 

(^^){i,i)J 

Recalling the definition (j3.4p of S, we can express 
(3.9) 

r{VD)u + l + ^/^es {rv - eZf 
u + eZ (/ + yD)(i,i)_ 

We condition on an arbitrary realization of the random matrix V . This fixes the 
number (yZ))ii, the vectors u, w and the matrix involved in (j3.9p . All random- 
ness thus remains in the independent random variables r (which is chosen uniformly 
in T), s ~ A']r(0, 1) and the independent random vector Z ~ N^(Q^ In-i)- We regard 
the random variable r as a global perturbation, and s, Z as local perturbations. 



-1 






"^11 Y^' 




X B'^ 



3.2. Invertibility via quadratic forms. Recall from (13. Sh that our goal is to 
bound below the quantity 

inf \\Ax\\2. 

Let Ai, . . . , An denote the columns of A. Let h E C" be such that 

|2 = 1, h^Ai = 0, i = 2,...,n. 
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For every x E C" we have 

n n 

\\Ax\\2 = \^XiA^ yW^XiAi = \xi\-\h^ Ai\. 



1=1 



i=l 



Since > l/\/n for all vectors x G Si, this yields 
(3.10) inf \\Ax\\2 > \h^Ai\. 



n 



We thus reduced the problem to finding a lower bound on Let us express 

this quantity as a function of X and Y in the decomposition as in (j3.9p . The 
following lemma shows that is essentially a quadratic form in X^Y , which is 

ultimately a quadratic form in Z. 



Lemma 3.2 (Quadratic form). Consider an arbitrary square matrix 
X B'^ 



A 



Assume that B is invertible. Let Ai, . . . , An denote the columns of A. Let h € C"" 
be such that 

\\h\\2 = l, h'^Ai = 0, i = 2,. 

Then 

\An -X'^B-^Y\ 



,n. 



\h^Ai\ 



^l + \\B-^Y\\i ■ 



Proof. The argument is from j23] Proposition 5.1]. We express h by exposing its 
first coordinate as 

r/iii 



h 



Then 
(3.11) 



h^Ai = {hi TJ) 



Au 
X 



h 



Auhi + h^X = Aiihi + X^h. 



The assumption that h'^Ai = for i > 2 can be stated as 



hiY'^ + h'^B'^. 



-hi ■ B-^Y. 



= {hi h') 

Equivalently, hiY + Bh = Hence 

(3.12) h = 
To determine hi, we use the assumption \\h\\2 = 1 which implies 

1 = |/ii|2 + ||/i||2 = |/i^|2 _^ |/j^|2 . \\B-^Y\\l. 

So 

1 

(3.13) \hi 



^1 + ||B-iy| 



Combining ()3.11|) and ()3.12p . we obtain 



\h'^Ai\ = Aiihi - hi ■ X'^B-^Y. 



This and (j3.13p complete the proof. 



□ 
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Let us use this lemma for our random matrix A in (|3.9p . One can check that the 
minor B is invertible almost surely. To facilitate the notation, denote 

M := B-^. 

Recall that M is a fixed matrix. Then 



^l + \\MYf^ 
Since as we know from (13.91). 



Aii = r{VD)ii + l + yf-ies, X = u + £Z, Y = rv - eZ, 

we can expand 
(3.14) 

,,T . , \r{VD)n + 1 + ^/^£s - ru^Mv - £r{MvY Z + eu^MZ + e^Z'^MZ] 

^^l + \\rMv - eMZf^ 

Recall that {VD)\i^ u, v, M are fixed, while r, s, Z are random. 

Our difficulty in controlling this ratio is that the typical magnitudes of ||M|| and 
of ||Mt;||2 are unknown to us. So we shall consider all possible cases depending on 
these magnitudes. 

3.3. When the denominator is small. We start with the case where the denomi- 
nator in ()3.14p is 0(1). The argument in this case will rely on the local perturbation 
given by s. 

Let K > 1 he a parameter to be chosen later, and let us consider the event 

Sdcnom = {\\rMv - eMZh < K} . 

This event depends on random variables r and Z and is independent of s. Let us 
condition on realizations of r and Z which satisfy ^denom- We can rewrite (|3.14p as 

,,T. , Ira + \/—les\ \ra + \/— 1 es\ 
\h Ai\>- ' > ' ' 



where a G C and r G T (and of course K) are fixed numbers and s ~ -A']r(0, 1). Since 
the density of s is bounded by 1 /\/27r, it follows that 

^sS^\h^Ai\<^^ <CX, A>0. 

Therefore, a similar bound holds for the unconditional probability: 

ip||/i"^^l| < ^ and^Tdcnom} 

Finally, using (j3.1Up this yields 



< CA, A > 0. 



(3.15) P I inf WAxh < -J^ and ^^dcnoml <CX, A > 0. 

This is a desired form of the invertibility estimate, which is useful when the event 
"^dcnom holds. Next we will analyze the case where it does not. 
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3.4. When the denominator is large and ||M|| is small. If <Sdonom does not 
occur, either ||Mw||2 or \\MZ\\2 must be large. Furthermore, since M is fixed and 
Z ~ A'i8(0, /n-i)i we have ||MZ||2 ~ ||M||hs with high probability. We shall consider 
the cases where ||M||hs is small and large separately. In this section we analyze the 
case where ||M||hs is small. The argument will rely on a the global perturbation r 
and the local perturbation Z. 

To formalize this, assume that ifdonom does not occur. Then we can estimate the 
denominator in ()3.14p as 



'1 + \\rMv - eMZ\\l < 2\\rMv - eMZ\\2 < 2\\Mv\\2 + e\\MZ\\2. 
Note that E ||MZ||| = ||M||yg. This prompts us to consider the event 

Saiz :={||MZ||2 <i^i||M||Hs}, 

where Ki > 1 is a parameter to be chosen later. This event is likely. Indeed, the 
map f{Z) = \\MZ\\2 defined on M'"~^ has Lipschitz norm bounded by ||M||, so a 
concentration inequality in the Gauss space (see e.g. \13\ (1.5)]) implies that 

F{£mz) > 1 - exp ( - ^^pl^) > 1 - eM-cK!). 
On the event £mz H £^cnom 



(3.16) ^Jl + \\rMv - £MZ\\j < 2||M?;||2 + e/Ci ||M||hs. 

Now we consider the case where ||M||hs is small. This can be formalized by the 
event 

K 



(3.17) <?M := |||M||hs < 2^^^ 

On the event £mz H "fdcnom the inequality p.l6p yields the following bound 

on the denominator in (|3.14p : 

1 + \\rMv - eMZ\\l < 2||M?;||2 + — . 
On the other hand, the left side of this inequality is at least K by £^^j^q^- Therefore 



(3.18) ^1 + \\rMv - eMZ\\l < 4||M7;||2. 

To estimate the numerator in (|3.14p , let us condition for a moment on all random 
variables but r. The numerator then takes the form \ar + h\ where a = (yD)ii — 
viJ Mv - e{MvY Z and b = 1 + y/^es + eu^ MZ + £^Z'^ MZ are fixed numbers and 
r is uniformly distributed in T. A quick calculation yields a general bound on the 
conditional probability: 

Pr {|ar + 6| > Ai|a|} > 1 - CAi, Ai > 0. 

Therefore a similar bound holds unconditionally. Let Ai G (0, 1) be a parameter to 
be chosen later. We showed that the event 

■^num := {numerator in (fmi) > \i\{VD)ii - Mv - e{Mv)^ Z\^ 

is likely: 

IP(^num) > 1 - CAi. 
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Assume that the event Snumf^SMzf^S^^^^^nSM occurs. (Here the first two events 
are Hkely, while the other two specify the case being considered in this section.) We 
substitute the bounds on the denominator (j3.18p and the numerator (given by the 
definition of <fnum) into (|3.14p to obtain 

I > Xi\{VD)u - Mv - ejMvy Z\ 
~ 4||Mf||2 
We can rewrite this inequality as 

|,T,|^|,^Aie J , {VD)n - u'Mv Mv 
\n All — " "I ■ ^ b where a = Ai • , w :— 



4 ' 4||Mu||2 ll^^^lb 

Here d is a fixed number and w is a fixed unit vector, while Z ~ A'k(0, 
Therefore w"^ Z = 9^, where 7 ~ A'k(0, 1), and 9 £ C, \9\ = 1. A quick density 
calculation yields the following bound on the conditional probability 

¥zS^\d+^ ■w'^Z\< XXie^ <CX, A > 0. 

Hence a similar bound holds unconditionally: 

P { I h^Ai I < AAie and £,,^^ D £mz n n£M}<CX, A > 0. 

Therefore, 

P {{h'Ail < AAie and ^T^,,^ n £m} < H^LJ + PiSliz) + CX 

< CXi + exp{-cKf) + CX, A > 0. 

Using (|3.10p . we conclude that 
(3.19) 

P I inf \\Ax\\2 < and f^enom ^£m \ < CXi + exp{-cKf) + CX, A > 0. 

3.5. When ||M|| is large. The remaining case to analyze is where ||M|| is large, i.e. 
where £m does not occur. Here shall estimate the desired quantity infa^gSj ||^2;||2 
directly, without using Lemma 13.21 The local perturbation Z will do the job. 



Indeed, on £'j^^ we have 



1 „„ i„ 1 K 



B'H > — I|S-1||hs = ^I|M||hs > 



/n" ^/n" 2eKiy/n 

Therefore there exists a vector w G C""^ such that 

(3.20) ll«^l|2 = l, \\Bw\\2< 

K 

Note that w can be chosen depending only on B and thus is fixed. 
Let X £ Si he arbitrary; we can express it as 

1 



(3.21) X 

Set 



Xl 
X 



where |xi| > 



w 
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Using the decomposition of A given in (j3.9|) . we obtain 

r^ii 

X B'^ 



\Ax\\2 > \w'^Ax\ 



[O w'^] 



= \xi ■ vb^X + w'^B^x\ 

> \xi\ ■ \w^X\ — ||i?it;||2 (by the triangle inequahty) 

> ^ Iw'^Xl - '^^^^y^ (using (Km and IKM). 
vn A 



Recahing from p.9p that X = u + eZ and taking the infimum over x ^ Si, we obtain 

n „ . „ 1 , T T^i 'ieKiJE 

mf L4rc 2 > u + Z — . 

xeSi Jn K 



Recall that w, u are fixed vectors, \\w\\2 = 1, and Z ~ N^{0, In-i)- Then w'^ Z = 6^, 
where 7 ~ A'k(0, 1), and 9 £ C, \6\ = 1. A quick density calculation yields the 
following bound on the conditional probability: 



Fz{\w'^u + ew'^Z\<eX^ <CX, A > 0. 



Therefore, a similar bound holds unconditionally, after intersection with the event 
<5|,j. So we conclude that 

(3.22) P I inf ||Ax||2 < ^ - ^£^1^ and Sir] <CX, A > 0. 
1x651 ^/n K I 



3.6. Combining the three cases. We have obtained lower bounds on infa^gs^ ||^2;||2 
separately in each possible case: 

• inequality (|3.15p in the case of small denominator (event ^'denom)) 

• inequality (j3.19p in the case of large denominator, small ||M|| (event £^^,^0^^^^ 

£m)', 

• inequality (I3.22P in the case of large ||M|| (event 
To combine these three inequalities, we set 

1 / A AAi A 2KiJE 
fi := — mill 



2 \K^/li' V^' K 



We conclude that if the condition (13. Sp on Kq and the condition ()3.7p on are 
satisfied, then 

inf Px||2 < 2/ie I < CA + (CAi + (iicp{-cKl) + CA) + CA 
xeSi I 



Substituting into (|3.8p . we obtain 



3CA + CAi + e^v{-cKl) 



inf \\{D + U)x\\2 <fie\< 3CA + CAi + exp(-civ:f ) + 2 exp(-cK^n). 

xGS'i J 

The same holds for each Si, i £ [n]. Substituting into ()3.ip . we get 

IP {smin(-D + f/) < /ue} < 3CAn + CXin + exp{-cKf)n + 2exp{-cKQn)n. 
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This estimate holds for all A, Ai,e G (0, 1) and all K,Kq,Ki > 1 provided that the 
conditions (j3.5|) on Kq and (j3.7p on /x are satisfied. So for a given e S (0, 1), let us 
choose 

X = X,=e^-\ Ko = Ki = log{l/e), K = ^ = Alog{l/e)ne-^-\ 

Then 

AAi £0-3 



41og(l/e)n3/2' 

Assume that e < c'n^^ for a sufficiently small absolute constant c' > 0; then one 
quickly checks that the conditions (j3.5p and ()3.7p are satisfied. For any such e and 
for the choice of parameters made above, our conclusion becomes 

IP <^ s^i^iD + U)< .,0 \ ^ e°-'n+exp(-clog2(l/e))+exp(-clog2(l/e)n)n. 

[ 41og(l/e)n'^/^ J 

Since this estimate is valid for all e < c'n~'^, this quickly leads to the conclusion of 
Theorem II. 1[ □ 

4. Orthogonal perturbations: proof of Theorem 11.31 
In this section we give a formal proof of Theorem 11.31 

4.1. Initial reductions of the problem. 

4.1.1. Eliminating dimensions n = 2,3. Since our argument will make use of (n — 
3) X (n — 3) minors, we would like to assume that n > 3 from now on. This calls for 
a separate argument in dimensions n = 2, 3. The following is a somewhat stronger 
version Theorem 11.31 in these dimensions. 

Theorem 4.1 (Orthogonal perturbations in low dimensions). Let B be a fixed nxn 
complex matrix, where n £ {2,3}. Assume that 

(4.1) \\BB'^ - I\\>6\\Bf 

for some 6 G (0, 1). Let U be a random matrix uniformly distributed in 0{n). Then 

^{Smin{B + U)<t}<Cit/5y, t>0. 

We defer the proof of Theorem 14.11 to Appendix Rl 

Theorem 14.11 readily implies Theorem 11.31 in dimensions n = 2,3. Indeed, the 
assumptions in (II. 2p yield 



\\DD'^ - III = max \df - 1| > - max \df - d^A > - > \\Df. 

i 2 i,j ■' 2 2il^ 

Therefore we can apply Theorem 14.11 with 5/2K'^ instead of 5, and obtain 

IP {smin(^ + U)<t}< C{2Kh/6y, t > 0. 

Thus we conclude a slightly stronger version of Theorem 11.31 in dimensions n = 2, 3. 
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Remark 4.2 (Complex orthogonality). The factor \\B\\'^ can not be removed from 
the assumption (j4.ip . Indeed, it can happen that — -^|| = 1 while B + U is 

arbitrarily poorly invertible. Such an example is given by the matrix = M - [ } Jj^] 
where M ^ oo. Then det{B + [/) = ! for all U G 0(2); \\B + U\\ ~ M, thus 
Smm{B + [/) < 1/M ^ 0. On the other hand, BB'^ = 0. 

This example shows that, surprisingly, staying away from the set of complex 
orthogonal matrices at (any) constant distance may not guarantee good invertibility 
oi B + U. It is worthwhile to note that this difficulty does not arise for real matrices 
B. For such matrices one can show that factor can be removed from (j4.ip . 

Remark 4.3. Theorem 14.11 will be used not only to eliminate the low dimensions 
n = 2 and n = 3 in the beginning of the argument. We will use it one more time in 
the heart of the proof, in Subsection 14.3.61 where the problem in higher dimensions 
n will get reduced to invertibility of certain matrices in dimensions n = 2, 3. 

4.2. Local perturbations and decomposition of the problem. We can repre- 
sent U in Theorem 11.31 as U = V~^W where V,W ^ 0{n) are random independent 
matrices, V is uniformly distributed in 0{n) while W may have arbitrary distribu- 
tion. We are going to define as a small random perturbation of identity. 

4.2.1. The local perturbation S. Let S be an independent random Gaussian skew- 
symmetric matrix; thus the above-diagonal entries of S are i.i.d. A'k(0, 1) random 
variables and S'^ = —S. By Lemma \3A] Wq = I + eS is approximately orthogonal 
up to error O(e^). Although this lemma was stated over the complex numbers it 
is evident from the proof that the same result holds over the reals as well (skew- 
Hermitian is replaced by skew-symmetric, and U{n) by 0(n)). 

More formally, fix an arbitrary number e G (0,1). Applying the real analog 
of Lemma 13.11 for the random matrix Wq = I + eS, we obtain a random matrix 
W E 0{n) that satisfies 

s^UD + U) = s^iniD + V^^W) = s^iniVD + W) 

> Smin{VD + Wo)-\\W-Wo\\ 

> s^iniVD + Wo) - 2e^\\S^\\ whenever e^\\S^\\ < 1/4. 

Further, \\S\\ = 0{^/n) with high probability. Indeed, let Kq > 1 be a parameter 
to be chosen later, which satisfies 

(4.2) £'^K^n<l/4. 
Consider the event 

(4.3) £s ■■= {\\S\\ < KoV^}; then ¥{£^) < 2exp(-cK^n) 
provided that 

(4.4) Ko > Co 

for an appropriately large constant Co- Indeed, by rotation invariance S has the same 
distribution as {§ — S'^)/\/2 where S is the matrix with all independent N^{0, 1) 
entries. But for the matrix S, a version of ()4.3p is a standard result on random 
matrices with iid entries, see |22l Theorem 5.39]. Thus by triangle inequality, (14. 3p 
holds also for S. 
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On £s, one has e'^\\S'^\\ < 1/4 due to (jOj) . and thus 

Smin(^ + U)> s^UVD + Wo) - 2e^Kln. 
Next, let IX G (0, 1) be a parameter to be chosen later, and which satisfies 

(4.5) ^ > 2eKln. 
Our ultimate goal will be to estimate 

By the above, we have 

P < F [srmniVD + Wq) < fiE + 2e^ a Es] + n^s) 
< IP {sminiVD + Wo) < 2fie A £s} + n^s) 

(4.6) < P {smin(^) < A £s} + 2exp{-cK^n), 
where 

A:=VD + I + eS. 

Summarizing, we have introduced a local perturbation S, which we can assume to 
be well bounded due to £s- Moreover, S is independent from V, which is uniformly 
distributed in 0(n). 

4.2.2. Decomposition of the problem. We are trying to bound below 

Smin(^) = inf ||^a;||2. 

Our immediate task is to reduce the set of vectors x in the infimum to those with 
considerable energy in the first two coordinates, + |x2p > 1/n. This this allow 
us to introduce global perturbations R and Q, which will be rotations of the first 
few (two or three) coordinates. 

To this end, note that for every x G 5""^ there exists a coordinate i G [n] such 
that \xi\ > n~^/'^ . Therefore 

(4.7) 5"-^ = U ^^^^^ '■= ^ ^"'^ • l^^l - ^"^^^}- 

More generally, given a subset of indices J G [n], we shall work with the set of 
vectors with considerable energy on J: 



Sj := |x G 5""^ : J^xf > 



i6J 

Note that the sets Sj increase by inclusion: 

Ji ^ J2 implies Sj-^ C Sj^. 

To simplify the notation, we shall write 5'i^2,3 instead of Sji 2,3}, etc. 

Using (|4.7p . we decompose the event we are trying to estimate as follows: 



{smin(^) < 2/ze} = U { mf px||2 < 2/ie|. 



Next, for every i G [n] 



max \df — dj\^ > - max \df — dj\'^, 

je[n] 2 i,je[n] 
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SO the second assumption in (jl.3|) implies that there exists j = £ [n], j ^ i, 
such that \df — d'^^-^\ > 6. 

Since Si C we obtain from the above and (|4.6p that 



n . 



inf \\Ax\\2 < 2n£A£s } +2exp{-cK^n) 



(4.8) =: ^p, + 2exp(-ciC2 



2n) 



We reduced the problem to estimating each term pi. This task is similar for each 
i, so without loss of generality we can focus on i = 1. Furthermore, without loss of 
generality we can assume that j(l) = 2. Thus our goal is to estimate 



Pi=F{ inf Px||2 < 2fieA£s 
under the assumption that 

(4.9) \dl-dl\>6. 

Finally, we further decompose the problem according to whether there exists a 
well invertible (n — 3) x (n — 3) minor of ^(1,2) or not. Why we need to consider 
these cases was explained informally in Section [2.2.31 

Let Ki > 1 be a parameter to be chosen later. By a union bound, we have 

Pi<Vp| inf \\Ax\\2<2^,eA\\iA(^^^2,^)r^\\<—^Ss] 

+ p| inf Px||2 < 2/ie A IK^n 2,i))"i > — Vi G [3 : n] A^:^! 

n 

(4.10) =: X]^'M+^'l,0• 



^=3 



4.3. When a minor is well invertible: going 3 dimensions up. In this section 
we shall estimate the probabilities pi^j, i = 3, . . . ,n, in the decomposition (j4.10p . 
All of them are similar, so without loss of generality we can focus on estimating ^1,3. 
Since 51^2 ^ 'S'1^2,3) we have 

(4.11) m,3<ip| inf Px||2 <2/xeA||(An,2,3))"'|| < — A^Tsj. 

This is the same as the original invertibility problem, except now we have three 
extra pieces of information: (a) the minor ^(1^2,3) is well invertible; (b) the vectors 
in 5'i^2,3 over which we are proving invertibility have large energy in the first three 
coordinates; (c) the local perturbation S is well bounded. 
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'Qo 


0' 


, R:= 


Ro 


0" 





/ 





/ 



4.3.1. The global perturbations Q, R. The core argument in this case wih rely on 
global perturbations (rather than the local perturbation S), which we shall now 
introduce into the matrix A = VD + / + eS. Define 

Q:= 

where Qo G -50(3) and Rq G 0(3) are independent uniform random matrices, and 
/ denotes the identity on C^^'"!. 

Let us condition on Q and R for a moment. By the rotation invariance of the ran- 
dom orthogonal matrix V and of the Gaussian skew-symmetric matrix S, the (condi- 
tional) joint distribution of the pair iV, S) is the same as that of {Q^ RVQ, SQ). 
Therefore, the conditional distribution of A is the same as that of 

Q^RVQD + 1 + eQ^SQ = Q^{RVQDQ^ + 1 + eS)Q =: A. 

Let us go back to estimating pi^3 in (j4.1ip . Since A and A are identically distributed, 
and the event £s does not change when S is replaced by Q^SQ, the conditional 
probability 

Px||2< 2^6 A 11(^(1,2,3)] 



inf 



<--^A£s 

e 



Q,R 



does not change when A is replaced by A. Taking expectations with respect to Q 
and R we see that the full (unconditional) probability does not change either, so 



(4.12) 



Pl,3 



< 



inf Px||2 < 2/xe A ||(A(i,2.3): 



< ASs 

£ 



4.3.2. Randomizing D. Let us try to understand the terms appearing in A. We 
think of 

b := QDQ^ 

as a randomized version of D obtained by a random change of basis in the first three 
coordinates. Then we can express 

A = Q^AQ where A = RVD + I + eS. 

Compared to A = VD + / + eS, the random matrix A incorporates the global 
perturbations Q and R. Thus we seek to replace A with A in our problem. To this 
end, let us simplify two quantities that appear in ()4.12p . 
First, 



(4.13) 



inf ||^2;||2 
3;e5i,2,3 



inf \\Ax\\2 
2-'eoi,2,3 



since Q(S'i,2,3) = 5'i,2,3 by definition and G SO{n). Second, using that Q and R 
affect only the first three coordinates and since D is diagonal, one checks that 



(4.14) yl(i,2,3) = ^(1,2,3) ={VD + I + e5)(i,2,3) 

Similarly to previous matrices, we decompose S as 
r, \Sq —Z^ 



(yz) + I + eS) 



(1,2,3)- 



(4.15) 



S 



where 5o G 



p3x3 



S G 



(n-3)x(n-3) 



Note that 5o, S and Z are independent, and that Z G 
with all i.i.d. A']r(0, 1) entries. 



(n 3)x3 jg random matrix 
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By (|4.14p . ^(1^2,3) is independent of Sq, Z, Q, R; it only depends on V and S. 
Let us condition on 5*0, S and V, and thus fix 2,3) such that the invertibility 
condition in (|4.12|) is satisfied, i.e. such that 

ii(Ai,2,3)r'ii<^ 

(otherwise the corresponding conditional probability is automatically zero). All 
randomness remains in the local perturbation Z and the global perturbations Q, R. 

Let us summarize our findings. Recalling (|4.13p . we have shown that 



(4.16) 
where 



So,S,V 



pi,3< inf Fz,Q,R< inf \\Ax\\2 < 2fie A £3 



A = RVD + I + eS, D:= QDQ^ , 
where S is decomposed as in ()4.15p . and where the infimum is over all S, V satisfying 



(4.17) 



\\{VD + I + eS\i^2,. 



3); 



< 



Compared with (j4.1ip . we have achieved the following: we introduced into the prob- 
lem global perturbations Q, R acting on the first three coordinates. Q randomizes 
the matrix D and R serves as a further global rotation. 

4.3.3. Reducing to invertibility of random 3x3 matrices. Let us decompose the 
matrix A = RVD + I + eS hy revealing its first three rows and columns as before. 
To this end, recall that R= [^^ j] and Q = ['^^ j] • We similarly decompose 



V 



Vo v' 
u V 



and 
(4.18) 



D 



Do 0' 
D 



then D 



Do 
D 



where Do := QoDQJ- 



Using these and the decomposition of S in (I4.15p . we decompose 

'RoVqDo + I0 + eSo RovD - eZ^ 
uDo + eZ Vb + I + eS 



(4.19) A = RVD + I + eS 





'Ho 


Y' 




X 


H 



where Iq denotes the identity in C^. 
Note that 

H = VD + I + eS = {VD + 1 + eS)^i^2,3) 
is a well invertible matrix by (|4.17p . namely 



(4.20) 



\H 



< 



The next lemma reduces invertibility of A to invertibility of a 3 x 3 matrix Hq 
YH-^X. 



24 MARK RUDELSON AND ROMAN VERSHYNIN 

Lemma 4.4 (Invertibility of a matrix with a well invertible minor). Consider a 
matrix 



H 

Assume that 



X H 



where Hq G C3^^ H G c^"-^)^^"-^). 



\H'^\\<Li, ||y|| < L2 



for some Li,L2 > 0. Then 



inf \\Hx\\2 > ^.^l^ ^ , s^i„(/?o - YH-^X). 
Proof. Choose x G 5'i,2,3 which attains inf^^ggj 23 ||i7x||2 =: 6 and decompose it as 

where G C^, x G C""^. 



X 



Xq 
X 



Then 

Xxo + Hx 

The assumption ||ffx||2 = 5 then leads to the system of inequalities 

{\\HqXq + Yx\\2 < 5 

\\\Xxo + Hx\\2<S 

We solve these inequalities in a standard way. Multiplying the second inequality by 
we obtain 

\\H-^Xxo + x\\2 < S\\H-^\\ < 5Li, 

which informally means that x w —H~^Xxq. Replacing x with —H~^Xxo in the 
first equation, and estimating the error by the triangle inequality, we arrive at 

\\HoXo - YH-^Xxoh < WHqXo + Yx^ + \\Yx + YH^^Xxoh 
< 5 + ||y|| • \\x + H^^Xxoh 
<6 + L2-6Li = 6{l + LiL2). 

Note that the left hand side is ||(-ffo ~ YH~^X)xq\\2, and that ||xo||2 ^ since 
X G 5'i^2,3- By the definition of the smallest singular value, it follows that 

s„,in(ifo - YH-^X) <^6{l + L1L2). 
Rearranging the terms concludes the proof of the lemma. □ 

In order to apply Lemma 14.41 for the matrix A in ()4.19p , let us check that the 
boundedness assumptions are satisfied. We already know that ||^~^|| < Ki/e from 
(OOD . Further, 

||y|| = \\R^vb-eZ'^\\. 

Here, ||-Ro|| = !> Il'^^ll ^ ll^ll ^ 1) < ||-D|| < K by the assumption of the 

theorem, and \\Z\\ < \\S\\ < Kq^/u if the event £3 holds. Putting these together, we 
have 

||y|| <K + eKoV^ < 2K 
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where the last inequahty follows from (j4.2p . An application of Lemma 14.41 yields 
that, on the event £3 one has 

(4.21) inf \\Ax\\2>—4^s^UH^-YMX) 
where 

(4.22) M:=H-^, ||Af|| < 

We have reduced our problem to invertibility of the 3x3 matrix Hq — YMX. 

4.3.4. Dependence on the global perturbation R. Let us write our random matrix 
Hq — YMX as a function of the global perturbation Rq (which determines R). 
Recalling (|4.19|) . we have 

Hq - YMX = RqVqDo + I0 + eSo - {RqvD - £Z'^)M{uDo + eZ) = a + Rob, 
where 

a := /o + eSo + eZ'^MuDo + e^Z'^MZ, 

(4.23) b:=VoDo-vDM{uDo + eZ). 

It will be helpful to simplify a and b. We shall first remove the terms eSq and 
e'^Z'^MZ from a, and then (in the next subsection) remove all other terms from a 
and b that depend on Z. 

To achieve the first step, observe that on the event £37 we have 

\\eSo\\ < sKoV^; 
We^Z'^MZW < e^\\Zf\\M\\ < e^H^f ||M|| 

< • Kin ■ ^ (by definition of £3 and (IMjl ) 
= eKlKiu. 

Therefore we can approximate a by the following simpler quantity: 

(4.24) ao := Iq + eZ'^ MuDq, 

||a — oolb < eK^y/n + eK^Kin < 2eKlKin. 
Hence we can replace a by oq in our problem of estimating 

(4.25) Smin(^^o - YMX) = s^inia + Rob) > s^^ao + Rob) - 2eKlKin. 

We have reduced the problem to the invertibility of the 3x3 random matrix a + Rob. 

4.3.5. Removing the local perturbation Z . As we mentioned in the introduction, the 
argument in this case (when the minor is well invertible) relies on global pertur- 
bations only. This is the time when we remove the local perturbation Z from our 
problem. To this end, we express ao + Rob as a function of Z using (|4.24p and (j4.23p : 

ao + Rob = L + Z'^eMubo - RqvDMeZ, 

where 

(4.26) L:=Io + Ro{Vo - vDMu)Do. 

If we condition on everything but Z, we can view ao+Rob as a Gaussian perturbation 
of the fixed matrix L. It will then be easy to show that oq + Rob is well invertible 
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whenever L is. This will reduce the problem to the invertibility of L; the local 
perturbation Z will thus be removed from the problem. 

Formally, let Ai G (0, 1) be a parameter to be chosen later; we define the event 

■= {Smm{L) > Ai} . 

Note that is determined by Rq, Qq and is independent of Z. 
Let us condition on Ro,Qo satisfying £i. Then 

(4.27) Smin(ao + R«b) > Smin(L) • Smin (^"^flO + Rob)) > Xl ■ Smm(/o + f{Z)) 

where 

(4.28) f{Z) := L'^Z'^eMuDo - L-^RqvDMeZ 

is a linear function of (the entries of) Z. A good invertibility of Jq + f{Z) is guar- 
anteed by the following lemma. 

Lemma 4.5 (Invertibility of Gaussian perturbations). Let m > 1, and let f : M"* — t- 

C^^"^ be a linear (matrix-valued) transformation. Assume that \\f\\ < K for some 
K>1, i.e. ||/(^)||hs < K\\z\\2 for all z G . Let Z ~ N^{^,I^). Then 

^{SraUl+f{Z))<t}<CKt^l\ t>0. 

We defer the proof of this lemma to Appendix IB. 2[ 

We will use Lemma [4.51 with m = 3(n — 3), rewriting the entries of the (n — 3) x 3 
matrix Z as coordinates of a vector in M"*. In order to apply this lemma, let us bound 
||/(Z)||hs in (I4.28|) . To this end, note that < A|f^ if the event £l occurs; 

||M|| < Ki/e by K22\f : \\u\\ < \\U\\ = 1; ||i;|| < ||y|| < 1; ||5o|| = \\Do\\ < \\D\\ < K; 
\\D\\ < \\D\\ < K- \\Rq\\ = 1. It follows that 

||/(Z)||HS<2Ar^i^i^i||Z||HS. 
An application of Lemma 14.51 then yields 

IPz {Smin(/0 + f{Z)) <t}< CXY^KKi t > 0. 

Putting this together with (j4.27p . we have shown the following. Conditionally on 
Rq, Qq satisfying £l, the matrix oq + Rob is well invertible: 

(4.29) ¥z {smin(ao + ^o^) < Ait} < CX^^KKi t^^\ t > 0. 

This reduces the problem to showing that event £l is likely, namely that the random 
matrix L in (|4.26p is well invertible. The local perturbation Z has been removed 
from the problem. 

4.3.6. Invertibility in dimension 3. In showing that L is well invertible, the global 
perturbation Rq will be crucial. Recall that 

L = Iq + RqB, where B = {Vo-vDMu)Do. 

Then Sinin(-^) = Smin(^ + -^0 ^)- If we condition on everything but Rq, we arrive at 
the invertibility problem for the perturbation of the fixed matrix B hy a random 
matrix Rq uniformly distributed in 0(3). This is the same kind of problem that our 
main theorems are about, however for 3x3 matrices. But recall that in dimension 
3 the main result has already been established in Theorem 14.11 It guarantees that 
B + Rq^ is well invertible whenever B is not approximately complex orthogonal. 
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i.e. whenever H-B-B"*" — /|| ^ This argument reduces our problem to breaking 

complex orthogonality for B. 

Formally, let A2 G (0, 1) be a parameter to be chosen later; we define the event 

Sb ■■= [\\BB^ -I\\ > Aapf }. 

Note that £b is determined by Qq and is independent of Rq. 

Let us condition on Qq satisfying £b- Theorem 14.11 then implies that 

(4.30) FnoiSD = Pi?o {Smin(^) < Xl} = ^r, {sn,in{B + Rq^) < Ai} < C(Ai/A2)^ 

This reduces the problem to showing that £b is likely, i.e. that B is not approxi- 
mately complex orthogonal. 

4.3.7. Breaking complex orthogonality. Recall that 

B = {Vo- vDMu)Do =: TDq, Dq := QoDoQJ , 

where Qq is a random matrix uniformly distributed in 50(3). Thus Dq is a ran- 
domized version of -Dq obtained by a random change of basis. 

Let us condition on everything but Qq, leaving B fixed. The following general 
result states that if Dq is not near a multiple of identity, then T is not approximately 
complex orthogonal with high probability. 

Lemma 4.6 (Breaking complex orthogonality). Let n G {2, 3}. Let D = diag((ij) G 

max \di\ < K, Idif — di^] > S 

i 

for some K,5 > 0. Let T G C"^". Let Q he uniformly distributed in SO(n) and 
consider the random matrix B := AQDQ^ . Then 

(4.31) pjllSS"^ -/|| < I < C(^KV'5)^ t>0. 
We defer the proof of this lemma to Appendix IB. 3[ 

Let us apply Lemma [461 for Dq = diag((ii, ^2, d^). Recall that the assumptions of 
the lemma are satisfied by (|1.2p and (|4.9p . Then an application of the lemma with 
* = -^2 yields that 

(4.32) Fq,{S%) = Pq„ [wbB-" - I\\ < X2\\Bf} < C(A2KV<5)^ 

This was the remaining piece to be estimated, and now we can collect all pieces 
together. 

4.3.8. Putting all pieces together. By (j4.30p and (j4.32p . we have 

Fqo,Ro{£l) = ^Qo ^Boi^tlQo) < Pij(,(£'£|Qo) l{0o satisfies £b} + ^Qoi^s) 

< c{\i/X2y + c{X2Ky6y. 

By a similar conditional argument, this estimate and (14.29P yield 
Fqo,Ro,z {smin(ao + ^0&) < Alt A £3} < q, 

where 

(4.33) q := CX^^KKi t^'^ + C(Ai/A2)'= + CiXiK^/df. 
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Obviously, we can choose C > 1 and c < 1. By (j4.25p . 

Pz,Qo,% {smin(i^o " YMX) < X^t - 2eK^Kin A£s}<q 
and further by ()4.2ip . we obtain 

Thus we have successfully estimated pi^s in (|4.16|) and in (|4.1ip : 
4.34 pi,3<<z iorfi= ° ^ 

and where q is defined in (j4.33p . By an identical argument, the same estimate holds 
for all pi^i in the sum (j4.10p : 

(4.35) pi^i < q, i = 3, . . . ,n. 

Summarizing, we achieved the goal of this section, which was to show that A 
is well invertible on the set Si^2 in the case when there is a well invertible minor 

^{l,2,i)- 

Remark 4.7 (Doing the same for (n — 2) x (n — 2) minors). One can carry on the 
argument of this section in a similar way for (n — 2) x (n — 2) minors, and thus 
obtain the same estimate for the probability 

p| inf Px||2<2/xeA||(A(i,2))~'||< — A^sj- 
as we obtained in (I4.34p for the probability pi^3 in (j4.1ip . 

4.4. When all minors are poorly invertible: going 1 + 2 dimensions up. In 

this section we estimate the probability pi^o in the decomposition ()4.10p . i.e. 

(4.36) pi,o = p| inf Px||2 <2^eA||(%,2,^))"'|| > — Vie [3:n]Af5j. 

4.4.1. Invertibility of a matrix with a poorly invertible minor. The following analog 
of Lemma 14.41 for a poorly invertible minor will be helpful in estimating pi,o- Un- 
fortunately, it only works for (n — 1) x (n — 1) minors rather than (n — 3) x (n — 3) 
or (n — 2) X (n — 2) minors. 

Lemma 4.8 (Invertibility of a matrix with a poorly invertible minor). Consider an 
n X n matrix 



H 



Ho Y 
X H 



where Hq £ C, H £ 



(n-l)x(n-l) 



Assume that X ~ N^{i',e'^In-i) for some fixed v G ^ and e > 00 We assume 
also that H is a fixed matrix satisfying 



Although X is complex-valued, X — v is real valued variable distributed according to 
N{0,e^ln-i). 
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for some L > 0, while Hq and Y may be arbitrary, possibly random and correlated 
with X. Then 

inf \\Hx\\2 < ^ - T I < Ct./^, t > 0. 
xeSi Jn L I 



Proof. Choose x ^ Si which attains inf^.^^^ ||i^a;||2 =: 5 and decompose it as 

where xq G C, x G C"~\ 



X 



Xo 
X 



As in the proof of Lemma 14.41 we deduce that 

\\H^^Xxo + x\\2 < ^lli^-^ll. 



\x\\2 > WH'^Xxoh- 5\\H-'^\\. 



|^"^Xxo||2 = Ixol \\H~^X\\ > \\H-^X\ 

In 



This yields 
Note that 



where the last inequality is due to x G 5*1 . 

Further, we have ll-ff"^^!! ~ ||i?~^||HS by standard concentration techniques. We 
state and prove such result in Lemma IB. 21 in Appendix B. It yields that 

P {||i7^^X||2 < te||i?^^||Hs} < Ct^, t > 0. 

Next, when this unlikely event does not occur, i.e. when ||ii'~-'^Ar||2 > te||-ff~^||HS! 
we have 

||x||2 > ^Ili^-^HS - S\\H-'\\ >(1L.s]\\H-'\\>(^-6]l. 



/n / 
On the other hand, ||x||2 < ||x||2 = 1. Substituting and rearranging the terms yields 

Vn L 

This completes the proof of Lemma 14. 8[ □ 

4.4.2. Going one dimension up. As we outlined in Section r2.2.3l the probability pi.o 
in (j4.36p will be estimated in two steps. At the first step, which we carry on in this 
section, we explore the condition that all (n — 3) x (n — 3) minors of Ai^2 are poorly 
invertible: 

ll(V2.))"'ll > V ^^^[3^^]- 

Using Lemma [4.81 in dimension n — 2, we will conclude that the matrix ^(1,2) is well 
invertible on the set of vectors with a large i-th coordinate. Since this happens for 
all i, the matrix ^(1,2) is well invertible on all vectors, i.e. ||^(i^2)ll ia,rge. 
This step will thus carry us one dimension up, from poor invertibility of all minors 
in dimension n — 3 to a good invertibility of the minor in dimension n — 2. 

Since we will be working in dimensions [3 : n] during this step, we introduce the 
appropriate notation analogous to ()4.7p restricted to these dimensions. Thus S"'^'"' 
will denote the unit Euclidean sphere in C'^'"', so 

(4.37) 513-1 = IJ where :={xGS[3-] : |xi| >n-i/2}. 

i6[3:n] 
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We apply Lemma for 
Recall from Section [4.2.11 that 



(1,2,3), 



L 



e ' 



t 



A 



* 
* 

X 



Y 
H 



VD + I + eS, 



where S" is a skew-symmetric Gaussian random matrix (with i.i.d. A^]r(0, 1) above- 
diagonal entries). Let us condition on everything except the entries Sij with i £ [4 : 
n], j = 3 and with i = 3, j £ [4 : n], since these entries define the parts X, Y of H. 
Note that X = + eS^'^^ , where the vector i/ G C"~'^ is independent of S, and S^^^ 
is a standard real Gaussian vector with coordinates «S4^3, . . . , Sn,3- 

Lemma 14.81 used with t = then implies that if || (^(i,2,3))~^ II > Ki/e then 



Therefore, unconditionally, 



inf \\A 



[3:, 



(1,2)3^112 



< 



< 



Cn 



inf \\A 



x&S. 



(1,2)2;||2 



< 



A||(A( 



1,2,3). 



-'II >^ 



< 



Cn 



By an identical argument, the dimension 3 here can be replaced by any other di- 
mension i £ [3 : n]. Using a union bound over these i and (j4.37p . we conclude 
that 

e 

This is of course the same as 



inf P(i,2)x||2 < ^ A ||(^(i,2,i))-i > fli Vi G [3 : 



n 



< 



(4.38) 



{ll(Ai,2))''ll > A ||(%,2,i))^^|| > Vi G [3 : n]| 



< 



Crl_ 



This concludes the first step: we have shown that in the situation of pi^o when all 
minors ^(1^2, «) ^'i'^ poorly invertible, the minor ^{1,2) is well invertible. 

4.4.3. Going two more dimensions up. At the second step, we move from the good 
invertibility of the minor ^(1,2) that we have just established to a good invertibility of 
the full matrix A. But we have already addressed exactly this problem in Section[ 
except for the minor ^(1,2,3)- So no new argument will be needed in this case. 
Formally, combining (j4.36p . (j4.38p . and the estimate (|4.3p on Eg, we obtain 

inf ||Ax||2<2^*eA||(A(i,2)r"' ' ' ^ ' ^ 



< 



<—/\£s 

£ 



+ —— + 2eiLY>{-cK^n). 

xe5i,2 " e ~ ] Ki 

The probability here is very similar to the probability pi'i in (j4.1ip and is bounded 
in the same way as in ()4.34p . see Remark 14.71 We conclude that 

Cn' 



(4.39) 



Pi,o <q + -TT- + 2exp(-cA'o"-) 
A 
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where fj, and q are defined in (j4.34p and (|4.33p respectively. 

We have successfully estimated pi^ in the sum (j4.10p . This achieves the goal of 
this section, which was to show that A is well invertible on the set 5*1^2 in the case 
when there all minors ^(i,2,i) poorly invertible. 



4.5. Combining the results for well and poorly invertible minors. At this 
final stage of the proof, we combine the conclusions of Sections 14.31 and 14.41 
Recall from (jTTO]l that 

n 

Pi < ^Pl,i+Pl,0- 

i=3 

The terms in this sum were estimated in (j4.35p and in (j4.39p . Combining these, we 
obtain 

Pi < nq+ — h 2exp{-cKQn). 

K 



An identical argument produces the same estimate for all pi, i = 2, ... ,n in (j4.8p . 
Thus 

n 

P = P {smin{D + U)< fie} <Y,P^ + 2exp(-cif2^) 

i=l 

Cr? 

(4.40) < + -— + 2(n + l)exp(-cK^n). 

-f^i 

Recall that /i and q are defined in (j4.34p and (j4.33p respectively, and C > 1, c < 1 
in these inequalities. 

Finally, for t G (0, 1), we choose the parameters A'o > 1, i^i > 1, e, Ai, A2 G (0, 1) 
to make the expression in (j4.40p reasonably small. For example, one can choose 

i^o = log(l/t), = 

^9/8 



e 



24Xlog2(l/t)n3/2' 
With this choice, we have 

+9/8 

p < Cinh'/^^K^Sy, 

and so ()4.2p and (j4.5p are satisfied, and (|4.4p is satisfied whenever t < 6"*^°. Sum- 
marizing, we have shown that 

This quickly leads to the conclusion of Theorem 11.31 □ 
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5. Application to the Single Ring Theorem: proof of Corollary 11.41 

In this section we prove Corollary 11.41 which states that condition (SR3) can be 
completely eliminated from the Single Ring Theorem. Let Dn be a sequence of 
deterministic n x n diagonal matrices. (The case of random Dn can be reduced to 
this by conditioning on Dn-) If z 7^ 0, then 

(5.1) S^UUnDnVn " z/n) = \z\ ■ Smini{l/z)Dn " Un^-'), 

where the matrix Un^V~^ is uniformly distributed in U{n) or 0{n). Let us first 
consider the case where the matrices D„ are well invertible, thus we assume that 

r := inf Smin(-C'„) > 0. 
In the complex case, an application of Theorem 11.11 yields the inequality 

(5.2) P {Sn,ln{UnDnVn " z/n) < tr} < t^rf , < t < 1/2, 

which holds (uniformly) for all z G C, and which implies condition (SR3). Indeed, 
Theorem 11.11 combined with (|5.1|) imply the inequality (|5.2p for \z\ > r/2. In the 
disc \z\ < r/2 we use the trivial estimate 

^min{UnDnVn ~ zin) ^ '5min(fAi^n^) ~ |^| ^ 

which again implies (|5.2p . 

Now consider the real case, still under the assumption that r > 0. Condition 
(SRI) allows us to assume that ||-Dn|| ^ K for some K and for all n. Condition 
(SR2) and |10| Lemma 15] imply that |sfc(L'.„) — 1| > 1/(4ki) for some 1 < k < n. 
Hence 

inf \\Dn-V\\>—. 
veO{n) 4ki 

An application of Theorem 11.31 together with (j5.ip shows that inequality ()5.2p holds, 
which in turn implies condition (SR3). In this argument, we considered the matrix 
{l/z)Dn, which has complex entries. This was the reason to prove more general 
Theorem 11.31 instead of the simpler Theorem 11.21 

It remains to analyze the case where the matrices Dn are poorly invertible, i.e. 
when inf^gpj Smin(-Dn) = 0. In this case the condition (SR3) can be removed from 
the Single Ring Theorem using our results via the following argument, which was 
communicated to the authors by Ofer Zeitouni [23]. The proof of the Single Ring 
Theorem in [10] uses condition (SR3) only once, specifically in the proof of \10\ 
Proposition 14] which is one of the main steps in the argument. Let us quote this 
proposition. 

Proposition 14 (jlOj). Let v^^^ be the symmetrize^ empirical measure of the sin- 
gular values ofUnDnVn — zIn- Assumc that the conditions (SRI), (SR2), and (SR3) 
of the Single Ring Theorem hold. 

(i) There exists a sequence of events 0„ with P(r2n) — ^ 1 such that for Lebesgue 
almost every z £ C, one has 

(5.3) lim lim sup E [ In^ log |x| di/^") (x) = 0. 



Symmetrization here means that we consider the set of the singular values Sk together with 
their opposites — Sfe. 
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Consequently, for almost every z G C one has 
(5.4) / log |x| d^'^")(x) / log\x\du:^{x) 



for some limit measure in probability, 
(ii) For any R > and for any smooth deterministic function (p compactly sup- 
ported in Bji = {z £ C : \z\ < R}, one has 

(5.5) / ip{z) / log |x| (iz/|")(x) dm(z) — )• / (p{z) / log \x\ dvzix) dm{z). 



Our task is to remove condition (SR3) from this proposition. Since the argu- 
ment below is the same for unitary and orthogonal matrices, we will not distinguish 
between the real and the complex case. 

Even without assuming (SR3), part (i) can be deduced from Theorems 11.11 and 
11.21 bv the argument of [10], since condition (j5.3p pertains to a fixed z. 

It remains to prove (ii) without condition (SR3). To this end, consider the prob- 
ability measure fl with the density 

(5.6) ^(,) = ^a( [ log\x\dMx)). 

dm 2tt ViM / 

This measure was introduced and studied in [10] . After the Single Ring Theorem is 
proved it turns out that /2 = /Xg, where /^e is the limit of the empirical measures of 
eigenvalues. However, at this point of the proof this identity is not established, so 
we have to distinguish between these two measures. 

It was shown in [10] that for any smooth compactly supported function / : C — t- C 
such that condition (SR3) holds with some 5, 5' > for almost all z G supp(/), one 
has 

(5.7) / f{z)d^,t\z)^ [ f{z)dfL{z)- 

Jc Jc 

The argument in the beginning of this section shows that if Q := supp(/) C Bji\Br 
for some r > 0, then (|5.2p holds uniformly on Q, and therefore (15. 7p holds for such 

The proof of [10^ Theorem 1] shows that it is enough to establish (ii) for all smooth 
compactly supported functions if that can be represented as ip = Aip, where ^ is 
another smooth compactly supported function. Assume that (ii) fails, thus there 
exist e > 0, a subsequence {nk}^i, and a function : C — >■ C as above, such that 



Aip{z) / log\x\ di'["'''\x) dm{z) — / Aip{z) / log\x\ di'z{x) dm{z) 



> £. 



(5.8) 

Recall the following identity [101 formula (5)]: 

(5.9) I i^{z) d^J.'i'\z) = — [ A^{z) [ log\z\d4''\x)dm{z). 

Condition (SRI) implies that the sequence of measures /ui"''"^ is tight, so we can 

extract a further subsequence {fie ' which converges weakly to a probability 
measure fi. 
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We claim that /i = /i. Indeed, let / : C — )• [0, 1] be a smooth fmiction supported 
in Bfi \ Br for some r > 0. Then the weak convergence implies 

/ fiz)df,{z). 
Jc 



Since / satisfies (|5.7p . we obtain 



f{z)df,{z)= / f{z)dfi{z). 

C JC 

This means that the measure /U coincides with /i on C \ {0}. Since both /i and fx are 
probability measures, fi = jl. 

Since fl is absolutely continuous, we can choose r > so that jl{Br) < g^n^y — . 
Let : C — )• [0, 1] be a smooth function such that supp(r7) C Br and r]{z) = 1 for 
any z G Br/2- Then 



(5.10) 

and therefore 
(5.11) 



r]{z)djl{z) < 



Svr 



r]{z) d^^^'''^\z) < 



Svr 



for all sufficiently large I. Let us estimate the quantity in (|5.8p : 



A'(/'(z) / log I x| (ii^x '''\x) dm{z) — I Aip{z) I log \x\ duzix) dm{z) 



< 



ink,), 



A((l — 77)^) (z) / log\x\ duz ' (x) dm{z] 
c Jr 

— J A((l — r/)^) (z) I log \x\ di'zix) dm{z] 

rjif){z) I log\x\dvi '''■\x)dm{z) 
Jr 



Ai 



+ 



A(r]'ip^{z) / log\x\ duzix) dm{z) 



Consider the terms in the right hand side separately. By (j5.6p and (j5.10p . we have 



A(r/'(/') (z) / log \x\ duz{x) dm{z] 



= 2tt 
< 2tt 



[rji}) {z)dji{: 

r]{z)dfL{z] 



e 

<4- 



Similarly, ()5.9p and (15. lip imply that for large I 



riif){z) I log\x\dvi '''■\x) dm{z 



e 

<4- 
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The function Cp = A((l — r/)-)/;) is supported in the annulus Bji \ This 
function satisfies (|5.7p . so using (j5.6p and (j5.9p . we obtain 



/ (^(z) / log\x\ du^P^x) dm{z) 
Jc Jr 



Lp{z) / log\x\ duz{x) dm{z) 



2vr/ ((l-7?)^)(z)d/.h\z)-27r / ((1 - (z) 



The combination of these inequahties yields 

limsup /a^.) /■l„g|xM.f*.'(x)<i™(.)- /a*(.) /log|x|<i..(x)*„(.) 

/-s>oo Jc Jr Jc Jr 

e 

<2' 

which contradicts (15.81). □ 



(n) 

Remark 5.1. Convergence of the empirical measures of eigenvalues /ie to the limit 
measure /Xg does not imply the convergence of the eigenvalues to an annulus. Indeed, 
there may be outliers which do not affect the limit measure. For example, assume 
that {Dn}^=i is a sequence of diagonal matrices 

Dn = diag((ii, . . . ,(i„_i,0), 

where di, . . . , d„_i are independent random variables uniformly distributed in [1, 2]. 
Then the Single Ring Theorem asserts that the support of the measure /Xg is the 
annulus \/2 < |z| < y^7/3, see (11. 4p . At the same time, all matrices — U^iDi^^n 
have eigenvalue 0. 

Guionnet and Zeitouni [11] established sufficient conditions for the convergence 
of the spectrum of An to an annulus. Assume that the matrices satisfy (SRI), 
(SR2), and (SR3), and in addition: 

(SR4) Assume that 

-1/2 / foo \ -1/2 

-2 ■ 



(i/u(")(x)J ^a=U x-^dfis{x) 

\ 1/2 / rco 



and inf„ Sinin(-C'„) > whenever a > 0. 

Then |11|, Theorem 2] claims that the spectrum of An converges to the annulus 
a < \z\ < 6 in probability. 

Arguing as before, one can eliminate the condition (SR3) from this list. The other 
conditions are formulated in terms of the matrices Dn only. 



Appendix A. Orthogonal perturbations in low dimensions 

In this section we prove Theorem 14. H which is a slightly stronger version of the 
main Theorem 11.31 in dimensions n = 2 and n = 3. The argument will be based on 
Remez-type inequalities. 
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A.l. Remez-type ineqalities. Remez inequality and its variants capture the fol- 
lowing phenomenon: if a polynomial of a fixed degree is small on a set of given 
measure, then it remains to be small on a larger set (usually an interval). We refer 
to m [8] to an extensive discussion of these inequalities. 

We will use two versions of Remez-type inequalities, for multivariate polynomials 
on a convex body and on the sphere. The first result is due to Ganzburg and Brudnyi 
[HIS], see [3 Section 4.1]. 

Theorem A.l (Remez-type inequality on a convex body). Let V C he a convex 
body, let E (IV be a measurable set, and let f be a real polynomial on of degree 
n. Then 

sup|/(x)|< (^^)"sup|/(x)|. 

x&V \ \^\ J x&E 

Here \E\ and \V\ denote the m-dimensional Lebesgue measures of these sets. □ 
The second result can be found in [3, see (3.3) and Theorem 4.2 there. 

Theorem A. 2 (Remez-type inequality on the sphere). Let m € {1, 2}, let E CI 

be a measurable set, and let f be a real polynomial on M™+^ of degree n. Then 

sup |/(x)|< (^)'"sup|/(x)|. 

Here \E\ denote the m-dimensional Lebesgue measure of E. □ 

Remark A. 3. By a simple argument based on Fubini theorem, a similar Remez-type 
inequality can be proved for the real three-dimensional torus T^ := x S'^ C 
equipped with the product measure: 

(A.l) sup|/(x)|< (^y"sup|/(x)|. 

A. 2. Vanishing determinant. Before we can prove Theorem 14.11 we we establish 
a simpler result, which is deterministic and which concerns determinant instead of 
the smallest singular value. The determinant is simpler to handle because it can be 
easily expressed in terms of the matrix entries. 

Lemma A. 4 (Vanishing determinant). Let B be a fixed nxn complex matrix, where 
n G {2,3}. Assume that \\B\\ > 1/2. Let e > and assume that 

\det{B + U)\<e for all U e SO{n). 

Then 

\\BB'^ - L\\ < Ce\\B\\. 

Proof. To make this proof more readable, we will write a < 6 if a < Cb for a suitable 
absolute constant C, and a ~£ 6 if |a — 6| < e. 
Dimension n = 2. Let us represent 



U = C/(0) 



cos (p sm ( 
— sin 6 cos ( 



Then det(i? + C/) is a trigonometric polynomial 

det(i? + U) = ko + ki cos 4> + k2 sin ( 
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whose coefficients can be expressed in terms the coefficients of B: 

ko = det{B) + 1; h = Bu + B22; k2 = Bu - B21. 

By assumption, the modus of this trigonometric polynomial is bounded by e. There- 
fore all of its coefficients are also bounded, i.e. 

\ki\<e, i = 1,2,3. 

It is enough to check that all entries of BB^ are close to the corresponding entries 
of /. We will check this for entries (1, 1) and (1,2); others are similar. Then 

{BB^)ii = Bfi + Bf2 ~e' -B11B22 + B12B21 

where we used that \ki\ < e, |/c2| ^ £, and thus the resulting error can be estimated 
as 

e' <e{\Bu\ + \B,2\)<e\\B\\. 

But 

-BnB22 + ^12^21 = - det{B) 1, 
where we used that |A;o| ^ £■ We have shown that 

\iBB'^)u-l\<e\\B\\+e<e\\B\\, 

as required. 

Similarly we can estimate 



(BB^) 



12 



B11B21 + B12B22 ~e' B11B12 — B12B11 = 0. 



Repeating this procedure for all entries, we have shown that 

\{BB'')ij-Iij\<e\\B\\ 
for all This immediately implies the conclusion of the lemma in dimension 



n = 2. 

Dimension n 



3. We claim that 

-1; B,,,^,(-iy+^+^det{B'^), {1,2,3}, 



(A.2) det{B)^e-^, ^^3-e 

where B"^^ denotes the minor obtained by removing the i-th row and j-th column 
from B. 

Let us prove \K.2\ for i = j = 1; for other entries the argument is similar. Let 

"10 

U = U{(j)) = COS0 sin( 
— sin (f) cos ( 

Then as before, det{B + [/) is a trigonometric polynomial 

det(i? + U) = ko + ki cos + k2 sin cp 

whose coefficients can be expressed in terms the coefficients of B. Our argument 
will only be based on the free coefficient ko, which one can quickly show to equal 

Bii + 1 B12 -B13 
ko = det B21 B22 B23 

B31 Bs2 -B33_ 

As before, the assumption yields that |A;o| < e, so 
(A.3) -""^'"^ ' -^-^Dii 



+ Bii + 1 = det(S) + det(S^^) + Bu + L 



det{B) + det(5^^) + Bn + 1 0. 
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Repeating the same argument for 

"-1 

U = U {(j)) = COS (j) sin (p 

sin (p — cos ( 

yields 

(A.4) det{B) - det{B^^) - Bii + I 0- 

Estimates ()A.3P and ()A.4p together imply that 

det(B)«, -1; -det(B^i). 

This implies claim ()A.2p for i = j = 1; for other entries the argument is similar. 
Now we can estimate the entries of B^B. Indeed, by ()A.2p we have 

(yBB'^)ii = Bfi + BI2 + Bf^ 

(A.5) -Bii det(B") + B12 det{B^^) - B13 det{B^^), 

where the error e' can be estimated as 

e' <e{\Bu\ + \Bu\ + \Bn\)<e\\B\\. 

Further, the expression in (jA.SP equals — det(i?), which can be seen by expanding 
the determinant along the first row. Finally, — det(i?) 1 by ()A.2p . We have 
shown that 

\{B''B)u-l\<e\\B\\+e<e\\B\\, 

as required. 

Similarly we can estimate 

{BB'^)i2 = B11B21 + B12B22 + B13B23 

Bu det{B^^) - B12 det(s22) ^ det(s23) 

= -Bii(i?i2i?33 - -632-813) — Bi2{BiiB33 — B31B13) + i?l3(i?lli?32 - B31B12) 

= 

(all terms cancel). 

Repeating this procedure for all entries, we have shown that 

\{BB'^)ij-Iij\<e\\B\\ 

for all This immediately implies the conclusion of the lemma in dimension 
n = 3. □ 

A. 3. Proof of Theorem 14. 11 Let us fix t; without loss of generality, we can assume 
that t < (5/100. Let us assume that B + U is poorly invertible with significant 
probability: 

(A.6) F{SrainiB + U)<t}>p{6,t). 

where p{5, t) G (0, 1) is to be chosen later. Without loss of generality we may assume 
that U is distributed uniformly in SO{n) rather than 0{n). Indeed, since 0{n) can 
be decomposed into two conjugacy classes SO{n) and 0{n) \ SO{n), the inequality 
(fATHl) must hold over at least one of these classes. Multiplying one of the rows of 
B + U hy —1 if necessary, one can assume that it holds for SO{n). 

Note that ||-B|| > 1/2; otherwise Smin(-B + U) > I - \\B\\ > 1/2 > t for all 
U eO{n), which violates (lA.6p . 
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A. 3.1. Dimension n = 2. In this case the result follows easily from Lemma [A. 41 and 
Remez inequality. Indeed, the event Smin(-B + U) < t implies 

I det{B + U)\ = SminiB + U)\\B + U\\ < t{\\B\\ + 1) < 3t\\B\\. 

Therefore, by (IA.6P we have 

(A.7) F{det{B + U) < 3t\\B\\} > p{6,t). 

A random uniform rotation U = [-yx] £ SO{2) is determined by a random uni- 
form point {x,y) on the real sphere S^. Now, det(L' + [/) is a complex-valued 
quadratic polynomial in variables x, y that is restricted to the real sphere S^. Hence 
I det(L' + C/)p is a real- valued polynomial of degree 4 restricted to the real sphere 
S^. Therefore, we can apply the Remez-type inequality, Theorem lA.2| for the subset 
E ■.= {U det(L» + C/)|V||B||2 < 3t} of which satisfies \E\ > 2-Kp{5,t) according 
to (jXTj) . It follows that 

\det{B + U)\< (—^) °t\\B\\ for ah [/ G 50(2). 



An application of Lemma IA.4I then gives 



\BB^-I\\<C,(^]''"t\\B\\'. 



.p{6,t). 

On the other hand, assumption (|4.ip states that the left hand side is bounded below 
by (5||S|p. It follows that 

Now we can choose p{S,t) = C{t/6Y with sufficiently large absolute constant C 
and sufficiently small absolute constant c > so that inequality (jA.SP is violated. 
Therefore ()A.6p fails with this choice of p(5, t), and consequently we have 

^{smUB + U)<t}<C{t/5Y, 

as claimed. 

A. 3. 2. Dimension n = 3: middle singular value. This time, determinant is the 
product of three singular values. So repeating the previous argument would produce 
an extra factor of ||-B||, which would force us to require that 

\\BB'^ - I\\ > 6\\Bf. 

instead of (fO) . 

The weak point of this argument is that it ignores the middle singular value of B, 
replacing it by the largest one. We will now be more careful. Let si > 52 > S3 > 
denote the singular values of B. 

Assume the event Smin(-B + U) < t holds. Since \\U\\ = 1, the triangle inequality, 
Weyl's inequality and the assumption imply that the three singular values oi B + U 
are bounded one by + 1 < ||i?|| + 1 < 3||i?||, another by S2 + 1 and the remaining 
one by t. Thus 

\det{B + U)\ <3t{s2 + l)\\B\\. 

Let K > 2 he a parameter to be chosen later. Suppose first that S2 < K holds. 
Then | det{B + U)\ < 6tK\\B\\, and we shall apply Remez inequality. In order to 
do this, we can realize U G 50(3) as a random uniform rotation of the {x,y) plane 
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followed by an independent rotation that maps the z axis to a uniform random 
direction. Thus \J is determined by a random point (x, y, zi, ^2, 2^3) in the real 
three-dimensional torus T3 = x 5^, chosen according to the uniform (product) 
distribution. Here (x^y) G 5^ and (zi, 2:25-23) G S"^ determine the two rotations we 
described aboveJl 

We regard | det(i? + C/)p as a real polynomial in five variables x, y, zi, 22, -23 and 
constant degree, which is restricted to T3. Thus we can apply the Remez-type 
inequality for the torus, (|A.ip . and an argument similar to the case n = 2 yields 

(A.9) P {Sniin(5 ^U)<t}< C{tK/5f. 

Now we assume that S2 > K. We will show that, for an appropriately chosen K, 
this case is impossible, i.e. B + U can not be poorly invertible with considerable 
probability. 

A. 3. 3. Reducing to one dimension. Since si > S2 > K > 2, \t must be that S3 < 2; 
otherwise all singular values of B are bounded below by 2, which clearly implies 
that Srain{B + U) >1 for all U G 0(3)- This will allow us to reduce our problem to 
one dimension. To this end, we consider the singular value decomposition of B, 

B = siqipl + 5292^2 + S393P3' 

where {pi,P2,P3} and {(71, <?2> Q3} are orthonormal bases in C^. 

Assume the event Sniin(-B + U) < t holds. Then there exists x G C^, ||x||2 = 1, 
such that 

\\{B + U)x\\2<t. 

We are going to show that x is close to ^3, up to a unit scalar factor. To see this, 
note that ||-Bx||2 < 1 + t < 2, so 

(A.IO) 4 > \\Bx\\l = sl\p\x\^ + sl\p*2x\^ + sl\p%x\^ > K^i\plx\^ + \p*2x\^) 
= K\1-\pIx\'). 

It follows that 

(A.ll) ^'4^- l^3^l - ^ 

(the right hand side holds since IIP3II2 = ||a^||2 = !•) Let r] := p'^x/\p'^x\; then 
\\x - Wslli = \\x/r]-p3\\l = \p*i{x/ri -ps)]"^ + \pl{x/r] - ps)\^ + IpUx /r] - P3)\'^ 

= |pix|^ + |p2^P + \\P3x\ — l|^ (by orthogonality and definition of 77) 

4 16 , , , , 

-K^^W (by dAini) and dAlI])) 

8 

Now, by triangle inequality, 
(A.12) \qUB + U)p3\ = Iq^B + U)w3\ < WUB + U)x\ + \qUB + U){x - T]ps)\. 



^This construction and its higher-dimensional generalization follow the 1897 description of the 
Haar measure on SO(n) by Hurwitz, see 



41 

The first term is bounded by ||g3||2||(-B + ?7)a:||2 < t. The second term is bounded 
by 

M{B + u)h\\x- vpsh < iUBh + 1)^ = (.3 + < ^ < 

Therefore the expression in ()A.12p is bounded by t + 9/K. 

Summarizing, we have found vectors u,v G C"^, ||ti||2 = \\v\\2 = such that the 
event Smm(-B + U) < t imphes 

\u^{B + U)v\ <t + 9/K. 

Note that the vectors u = (93)"'', v = are fixed; they depend on B only. By ()A.6p . 
we have shown that 



P <^ \u 



^{B + U)v\ < t + 9/k} > p{S, t). 



We can apply Remez inequality for \vJ{B + U)v\'^, which is a quadratic polynomial 
in the entries of U. It yields 

(A.13) \u^(B + U)v\ < (-^Y\t + 9/K) for ah U G S0(3). 

\p{d, t)J 

Let Co G (0, 1) be a small absolute constant. Now we can choose 
(A.14) p{6, t) = C{t/6y, K = 4{6/t)^/^ 

with sufficiently large absolute constant C and sufficiently small absolute constant 
c > so that the right hand side in (|A.13p is bounded by cq. Summarizing, we have 
shown that 

(A.15) \u^{B + U)v\ < Co for ah U G S0{3). 

We are going to show that this is impossible. In the remainder of the proof, we shall 
write a ^ 1 to mean that a can be made arbitrarily small by a suitable choice of 
Co, i.e. that a < /(co) for some fixed real valued positive function (which does not 
depend on anything) and such that f{x) — >■ as a; — )• 0+. 



A. 3. 4. Testing on various U. Let us test (|AT5]l on U = U{4>) = 

Writing the bilinear form as a function of (p, we obtain 

u^{B + U)v = k + {uivi + U2V2) cos (j) + {uiV2 — U2V1) sin(^ 

where k = k{B, n, v) does not depend on <j). Since this trigonometric polynomial is 
small for all (j), its coefficients must are also be small, thus 

\uiVl + U2V2\ ^ 1, \U1V2 - U2Vl\ ^ 1. 

We can write this in terms of a matrix- vector product as 

Ul U2 
[-U2 Ul 

Since co is small, it follows that either the matrix [ ^^J^ ul ] is poorly invertible (its 
smallest singular value is small), or the vector [yl] has small norm. Since ||u||2 = 1, 
the norm of the matrix is bounded by \/2. Hence poor invertibility of the matrix is 



cos(/) s'm(j> 
— sin ip cos (f) 
1 









V2 



< 1. 
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equivalent to smallness of its determinant, which is -\- Formahy, we conclude 
that 



(A.16) either \vi\'^ + |t>2|^ < 1 or {uj + ul\ < 1. 



Assume that \vi\'^ + 
fOS]) onU = U{(t)) = 



"^2^ ^ 1; since ||?;||2 = 1 this implies l^sl > 1/2. Now test 
a similar argument yields 



cos (j) sin (j> 

10 
— sin (f) cos (f) 



\uiVi + UsVsl <^ 1, lilt's — U3Vi\ ^ 1. 

Since < 1, |n3| < 1, \vi\ <^ 1 and {v^l > 1/2, this system implies 

^ 1, \us\ ^ 1. 



Similarly, testing on U = U {(f)) 



10 

cos (j> sin (i 
— sin (h cos r 



, the same argument yields 



|ti2| ^ 1, \U'i\ <^ 1. 



So we proved that |ni| <C 1, |ti2| ^ 1, jiisl ^ 1. But this is impossible since 

\\u\\2 = 1. 

We have thus shown that in ()A.16P the first option never holds, so the second 
must hold. In other words, we have deduced from (jA.lSP that 

(A.17) \ul + ul\<^l. 

Using a similar argument (for rotations U in coordinates 1,3 and 2,3) we can also 
deduce that 

(A.18) \ul + ul\<^l, |tii + n^|<l. 

Inequalities ()A.17p and (jA.lSP imply that 

\ul\ < 1, \ul\ < 1, \ul\ < 1. 

But this contradicts the identity ||u||2 = 1. 

This shows that (lA.lSp is impossible, for a suitable choice of absolute constant 

Co- 



A. 3. 5. Conclusion of the proof. Let us recall the logic of the argument above. We as- 
sumed in (IA.6P that B + U \s poorly invertible with significant probability, 
P{gmin(^ + U) <t} > p{5,t)- With the choice p{5,t) = C{t/6y, K = A{5/tY/^ 
made in ()A.14p . we showed that either ()A.9P holds (in the case S2 < K), i.e. 
^ {sm\a{B + U) < t} < C{tK/5Y, or a contradiction appears (in the case S2 > K). 
Therefore, one always has 

F{smin(5 + U)<t}< max(p(5,t), C(t/^/<5)"). 



Due to our choice oi p{5,t) and K, the right hand side is bounded by C{tK / 5Y^'^ . 
This completes the proof of Theorem 14.11 □ 
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Appendix B. Some tools used in the proof of Theorem 11.31 

In this appendix we shall prove auxiliary results used in the proof of Theorem 1 1.31 
These include: Lemma lB.2l on small ball probabilities for Gaussian random vectors 
(which we used in the proof of Lemma 14. 8p , Lemma 14.51 on invertibility of Gaussian 
perturbations, and Lemma 14.61 on breaking complex orthogonality by a random 
change of basis. Some of the proofs of these results follow standard arguments, but 
the statements are difficult to locate in the literature. 

B.l. Small ball probabilities. 

Lemma B.l. Let X ~ A'K(/i, cj^) for some ^ £ W, a > 0. Then 

¥{\X\<ta}<t, t>0. 

Proof. The result follows since the density of X is bounded by l/(7\/27r. □ 

Lemma B.2. Let Z ~ N^{fi,a In) for some fi £ and Then 

¥{\\MZ\\2<ta\\M\\us} <Ct^/^, t > 0. 

Proof. By rescaling we can assume that a = 1. 

First we give the argument in the real case, for /i G M", M G M'^^'^. Let MJ 
denote the i-th row of M, and let /i = {ni, . . . , /i^)- Choose i G [n] such that ||Mj||2 > 
\\M\\ns/\^- Note that Z ~ N^{i^i, ||Mj|||) for some Ui G M". Lemma ED yields 
that 

p||m7^| < rllMilla} < Ct, t > 0. 

Since \\MZ\\2 > \MJ Z\ and ||-/V/j||2 > ||M||hs/\Aii this quickly leads to the comple- 
tion of the proof. 

The complex case can be proved by decomposing /x and M into real and imaginary 
parts, and applying the real version of the lemma to each part separately. □ 

B.2. Invertibility of random Gaussian perturbations. In this appendix we 
prove Lemma l4.5[ 

First we note that without loss of generality, we can assume that m = 18. Indeed, 
since / is linear it can be represented as 

where Oij and bij are some fixed vectors in M"*. By rotation invariance of Z, the 
joint distribution of the Gaussian random variables oJjZ and bJjZ is determined by 
the inner products of the vectors Oij and bij. There are 18 of these vectors; so we 
can isometrically realize them in M}^. It follows that the distribution of f{Z) is 
preserved, and thus we can assume that m = 18. 

Let > 1 be a parameter to be chosen later. By a standard Gaussian concentra- 
tion inequality, \\Z\\2 < R with probability at least 1 — 2exp(— ci2^). On this event, 
the matrix in question is well bounded: ||/ + /(^)|| < 1 + ||/(Z)||hs < 2KR, and 
consequently we have 

I det(/ + f{Z))\ < Snunil + f{Z)) • {2KRf. 



'This means that X — /j. is real valued variable distributed according to N{0, 
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Therefore we can estimate the probabihty in question as follows: 

(B.l) P{s,nin(I+/(^)) <i} 

< P{|det(/ + /(Z))| < {2KRft, \\Z\\ < /?} + 2exp(-ci?2). 

Since / is linear, | det(/ + f{Z))\'^ is a real polynomial in Z £ of degree 6, 
and thus we can apply Remez inequality, Theorem lA.ll We are interested in the 
Gaussian measure of the set 

E:={Z e M^^ : | det(/ + /(Z))| < {2KRft, \\Z\\ < R} 

which is a subset of 

V :={Z e M^^ : \\Z\\ < R}. 
The conclusion Theorem lA.ll is in terms of the Lebesgue rather than Gaussian 
measures of these sets: 

I det(/ + MZ)|2 < (^M^) ■ {{2KR)Hf for all Z eV. 



1 < ( ^llM ) . {2KRft, 



Taking square roots and substituting Z = in this inequality, we obtain 

1^1 
thus 

1^1 < Ci\V\ ■ {{2KR)hf''^ < C2R^^ ■ {{2KRftY'^, 
where the last inequality follows from the definition of V . Further, note that the 
(standard) Gaussian measure oi E is bounded by the Lebesgue measure \E\, because 
the density is bounded by density (27r)~^ < 1. Recalling the definition of E, we have 
shown that 

P{|det(/ + /(Z))| < {2KRft, \\Z\\ < R] < C2R^^ ■ {{2KRft)^/^ . 

Substituting this back into (jB.ip . we obtain 

P{|det(/ + /(Z)) <t}< C2R^^ ■ {{2KRft)^/^ + 2exp{-cR^). 

Finally, we can optimize the parameter R > 1, choosing for example R = t-^/iooo 
to conclude that 

P {| det(/ + f{Z)) <t}< C-iK^l^t^l^. 
This completes the proof of Lemma 14.51 □ 



B.3. Breaking complex orthogonality. In this section we prove Lemma 14.61 
about breaking complex orthogonality by a random change of basis. 

We will present the argument in dimension n = 3; the dimension n = 2 is very 
similar. Without loss of generality, we can assume that t < 1/2. Note that by 
assumption, 

\\B\\ < ||T||||L>|| < A"||T||. 
Then the probability in the left side of (|4.3ip is bounded by 

F^WBB'^ - I\\ < K^llrf t| = P |||f f - /|| < K^llrf i| , where f = TQD. 

We can pass to Hilbert-Schmidt norms (recall that all matrices are 3x3 here) and 
further bound this probability by 

P{||ff"^-/||Hs <3K2||T||^s4- 
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Assume the conclusion of the lemma fails, so this probability is larger than 
C{tK'^ / 5Y . We are going to apply Remez inequality and conclude that — /|| is 

small with probability one. Recalling the Hurwitz description of a uniform random 
rotation Q G 50(3) which we used in Section rA.3.2l we can parameterize Q by a 
uniform random point on the real torus T3 = S"^ x S'^ C M^. Under this parametriza- 
tion, (||rr"'" — /||Hs/3i^^||r||^g)^ becomes a polynomial in five variables and with 
constant degree restricted to T3. 

Our assumption above is that this polynomial is bounded by on a subset of T3 
of measure larger than C{tK'^ / 5Y . Then the Remez-type inequality for the torus 
(jA.ip implies that the polynomial is bounded on the entire T3 by 



\C{tK'^/5YJ - V 10^X2 

where the last inequality follows by a suitable choice of a large absolute constant C 
and a small absolute constant c in the statement of the lemma. This means that 

||Tf T - IIIhs < m^ml^ ■ < frills for all Q G S0{3). 

There is an entry of T such that |Tjj| > i||T||HS- Since the conclusion of the 
Lemma is invariant under permutations of the rows of T, we can permute the rows 
in such a way that Tij is on the diagonal, i = j. Furthermore, for simplicity we can 
assume that i = j = 1; the general case is similar. We have 

(B.2) |(ff)7i-l|<^||r||2js forallQG50(3). 

We shall work with Q of the form Q = Q1Q2 where Qi,Q2 G SO{3). We shall use 
Qi to mix the entries of T and Q2 to test the inequality (IB.2p . Let Qi = Qi{<p) = 

cos 4> sin (p 

- sin cost/) , (/> G [0, 2tt], and consider the matrix 
1 J 

G:=TQi. 

Since Gn = Tn cos (p — ^12 sin (j) and G12 = Tn sin (j) + T12 cos (p, one can find (p (and 
thus Qi) so that 

1,^ ,2 1 
-Til > — 



(B.3) |G'?i-G?2|>-|Tn|2>-||T||^s. 



Recall that f = TQ1Q2D = GQ2D. Substituting into inequality (lR2ll Q 



2 



100 
1 
1 



and Q 



2 



010 
100 

00-1 



we obtain 



\dlGli + dlC^a + d^Gl^ - 1| < ■^II^'IIhs; 



2 2 2 2 2 2 1 1 2 

^1^12 + ^2^11 + ^3^13 — 1| < 5QQ II-^IIhS- 

We subtract the second inequality from the first and conclude that 

(B.4) 1(^2 _^2)(^2^_ ^2^)1 <_^||3.||2^^_ 

On the other hand, recall that 1^1 — ^2! — ^ assumption and \Gii — Gi2\ > 
^||r||^g by jES]). Hence -0^2)! > ^ll^'llls- This contradicts (IBa . 

The proof of Lemma 14.61 is complete. □ 
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